Figure 1. ConstfucUon of a pC5 H6p WNV prM-M-E donor plasmid, pDS-2946-1-1 
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Figure 2. Sequence of C5 H6p WNV prM-M-E C5 in pDS-2646-1-1 

=>C5R 

1 TGAATGTTAA ATGTTATACT TTGGATGAAG CTATAAATAT GCATTGGAAA AATAATCCAT 

61 TTAAAGAAAG GATTCAAATA CTACAAAACC TAAGCGATAA TATGTTAACT AAGCTTATTC 

121 TTAACGACGC TTTAAATATA CACAAATAAA CATAATTTTT GTATAACCTA ACAAATAACT 

181 AAAACATAAA AATAATAAAA GGAAATGTAA TATCGTAATT ATTTTACTCA GGAATGGGGT 

241 TAAATATTTA TATCACGTGT ATATCTATAC TGTTATCGTA TACTCTTTAC AATTACTATT 

301 ACGAATATGC AAGAGATAAT AAGATTACGT ATTTAAGAGA ATCTTGTCAT GATAATTGGG 

361 TACGACATAG TGATAAATGC TATTTCGCAT CGTTACATAA AGTCAGTTGG AAAGATGGAT 

421 TTGACAGATG TAACTTAATA GGTGCAAAAA TGTTAAATAA CAGCATTCTA TCGGAAGATA 

481 GGATACCAGT TATATTATAC AAAAATCACT GGTTGGATAA AACAGATTCT GCAATATTCG 

541 TAAAAGATGA AGATTACTGC GAATTTGTAA ACTATGACAA TAAAAAGCCA TTTATCTCAA 

601 CGACATCGTG TAATTCTTCC ATGTTTTATG TATGTGTTTC AGATATTATG AGATTACTAT 

661 AAACTTTTTG TATACTTATA TTCCGTAAAC TATATTAATC ATGAAGAAAA TGAAAAAGTA 

721 TAGAAGCTGT TCACGAGCGG TTGTTGAAAA CAACAAAATT ATACATTCAA GATGGCTTAC 

781 ATATACGTCT GTGAGGCTAT CATGGATAAT GACAATGCAT CTCTAAATAG GTTTTTGGAC 

841 AATGGATTCG ACCCTAACAC GGAATATGGT ACTCTACAAT CTCCTCTTGA AATGGCTGTA 

901 ATGTTCAAGA ATACCGAGGC TATAAAAATC TTGATGAGGT ATGGAGCTAA ACCTGTAGTT 

961 ACTGAATGCA CAACTTCTTG TCTGCATGAT GCGGTGTTGA GAGACGACTA CAAAATAGTG 

1021 AAAGATCTGT TGAAGAATAA CTATGTAAAC AATGTTCTTT ACAGCGGAGG CTTTACTCCT 

1081 TTGTGTTTGG CAGCTTACCT TAACAAAGTT AATTTGGTTA AACTTCTATT GGCTCATTCG 

1141 GCGGATGTAG ATATTTCAAA CACGGATCGG TTAACTCCTC TACATATAGC CGTATCAAAT 

1201 AAAAATTTAA CAATGGTTAA ACTTCTATTG AACAAAGGTG CTGATACTGA CTTGCTGGAT 

1261 AACATGGGAC GTACTCCTTT AATGATCGCT GTACAATCTG GAAATATTGA AATATGTAGC 

1321 ACACTACTTA AAAAAAATAA AATGTCCAGA ACTGGGAAAA ATTGATCTTG CCAGCTGTAA 

1381 TTCATGGTAG AAAAGAAGTG CTCAGGCTAC TTTTCAACAA AGGAGCAGAT GTAAACTACA 

1441 TCTTTGAAAG AAATGGAAAA TCATATACTG TTTTGGAATT GATTAAAGAA AGTTACTCTG 

1501 AGACACAAAA GAGGTAGCTG AAGTGGTACT CTCAAAAAGG TACGTGACTA ATTAGCTATA 

1561 AAAAGGATCC GGGTTAATTA ATTAGTCATC AGGCAGGGCG AGAACGAGAC TATCTGCTCG 



=> H6p 

1621 TTAATTAATT AGAGC TTCTT TATTCTATAC TTAAAAAGTG AAAATAAATA CAAAGGTTCT 
1681 TGAGGGTTGT GTTAAATTGA AAGCGAGAAA TAATCATAAA TTATTTCATT ATCGCGATAT 

=> WNV caps id leader 
M TGI A V M I G L I A S V 
1741 CCGTTAAGTT TGTATCGTAA TGACCGGAAT TGCAGTCATG ATTGGCCTGA TCGCCAGCGT 

=> WNV prM 

. G A V TLSN FQG KVM MTVN ATD 
1801 AGGAGCAGTT ACCCTCTCTA ACTTCCAAGG GAAGGTGATG ATGACGGTAA ATGCTACTGA 

.VTD VITI PTA AGK NLCI VRA 
1861 CGTCACAGAT GTCATCACGA TTCCAACAGC TGCTGGAAAG AACCTATGCA TTGTCAGAGC 

.MDV GYMC DDT ITY ECPV LSA 
1921 AATGGATGTG GGATACATGT GCGATGATAC TATCACTTAT GAATGCCCAG TGCTGTCGGC 

.GN.D PEDI DCW CTK SAVY VRY 
1981 TGGTAATGAT CCAGAAGACA TCGACTGTTG GTGCACAAAG TCAGCAGTCT ACGTCAGGTA 

=> WNV M 

.GRC TKTR HSR RSR R S L T V Q T 
2041 TGGAAGATGC ACCAAGACAC GCCACTCAAG ACGCAGTCGG AGG TCACTGA CAGTGCAGAC 

. H G E S T L A N K K G A W MUST K A T 
2101 ACACGGAGAA AGCACTCTAG CGAACAAGAA GGGGGCTTGG ATGGACAGCA CCAAGGCCAC 

. R Y L V K T E S W I L R N P G Y A L V A 
2161 AAGGTATTTG GTAAAAACAG AATCATGGAT CTTGAGGAAC CCTGGATATG CCCTGGTGGC 

.AVI G W M L G S N T M Q R V V F V V L 
2221 AGCCGTCATT GGTTGGATGC TTGGGAGCAA CACCATGCAG AGAGTTGTGT TTGTCGTGCT 

=:> WNV E 

.LLL V A P A Y S F NCL GMSN RDF 
2281 ATTGCTTTTG GTGGCCCCAG CTTACAGCTT CAACTGCCTT GGAATGAGCA ACAGAGACTT 

.LEG VSGA TWV DLV LEGD SCV 
2341 CTTGGAAGGA GTGTCTGGAG CAACATGGGT GGATTTGGTT CTCGAAGGCG ACAGCTGCGT 

.TIM SKDK PTI DVK MMNM EAA 
2401 GACTATCATG TCTAAGGACA AGCCTACCAT CGATGTGAAG ATGATGAATA TGGAGGCGGC 

.NLA EVRS YCY LAT VSDL STK 
2461 CAACCTGGCA GAGGTCCGCA GTTATTGCTA TTTGGCTACC GTCAGCGATC TCTCCACCAA 

.AAC PTMG EAH NDK RADP AFV 
2521 AGCTGCGTGC CCGACCATGG GAGAAGCTCA CAATGACAAA CGTGCTGACC CAGCTTTTGT 

.CRQ GVVD RGW GNG CGLF GKG 
2581 GTGCAGACAA GGAGTGGTGG ACAGGGGCTG GGGCAACGGC TGCGGACTAT TTGGCAAAGG 

.SID TCAK FAC STK AIGR TIL 
2641 AAGCATTGAC ACATGCGCCA AATTTGCCTG CTCTACCAAG GCAATAGGAA GAACCATCTT 



mutated T5NT 

.KEN IKYE VAI FVH GPTT VES 
2701 GAAAGAGAAT ATCAAGTACG AAGTGGCC AT CTTCGTG CAC GGACCAACTA CTGTGGAGTC 

.HGN YSTQ VGA TQA GRFS ITP 
2761 GCACGGAAAC TACTCCACAC AGGTTGGAGC CACTCAGGCA GGGAGATTCA GCATCACTCC 

.AAP SYTL KLG EYG EVTV DCE 
2821 TGCGGCGCCT TCATACACAC TAAAGCTTGG AGAATATGGA GAGGTGACAG TGGACTGTGA 

.PRS GIDT NAY YVM TVGT KTF 
2881 ACCACGGTCA GGGATTGACA CCAATGCATA CTACGTGATG ACTGTTGGAA CAAAGACGTT 

.LVH REWF MDL NLP WSSA GST 
2941 CTTGGTCCAT CGTGAGTGGT TCATGGACCT CAACCTCCCT TGGAGCAGTG CTGGAAGTAC 

.VWR NRET LME FEE PHAT KQS 
3001 TGTGTGGAGG AACAGAGAGA CGTTAATGGA GTTTGAGGAA CCACACGCCA CGAAGCAGTC 

.VIA LGSQ EGA LHQ ALAG AIP 
3061 TGTGATAGCA TTGGGCTCAC AAGAGGGAGC TCTGCATCAA GCTTTGGCTG GAGCCATTCC 

.VEF SSNT VKL TSG HLKC RVK 
3121 TGTGGAATTT TCAAGCAACA CTGTCAAGTT GACGTCGGGT CATTTGAAGT GTAGAGTGAA 

.MEK LQLK GTT YGV CSKA FKF 
3181 GATGGAAAAA TTGCAGTTGA AGGGAACAAC CTATGGCGTC TGTTCAAAGG CTTTCAAGTT 

.LGT PADT GHG TVV LELQ YTG 
3241 TCTTGGGACT CCCGCAGACA CAGGTCACGG CACTGTGGTG TTGGAATTGC AGTACACTGG 

.TDG PCKV PIS SAA SLND LTP 
3301 CACGGATGGA CCTTGCAAAG TTCCTATCTC GTCAGCGGCT TCATTGAACG ACCTAACGCC 

.VGR LVTV NPF VSV ATAN AKV 
3361 AGTGGGCAGA TTGGTCACTG TCAACCCTTT TGTTTCAGTG GCCACGGCCA ACGCTAAGGT 

.LIE LEPP FGD SYI VVGR GEQ 
3421 CCTGATTGAA TTGGAACCAC CCTTTGGAGA CTCATACATA GTGGTGGGCA GAGGAGAACA 

.QIN HHWH KSG SSI GKAF TTT 
3481 ACAGATCAAT CACCATTGGC ACAAGTCTGG AAGCAGCATT GGCAAAGCCT TTACAACCAC 

.LKG AQRL AAL GDT AWDF GSV 
3541 CCTCAAAGGA GCGCAGAGAC TAGCCGCTCT AGGAGACACA GCTTGGGACT TTGGATCAGT 

.GGV FTSV GKA VHQ VFGG AFR 
3601 TGGAGGGGTG TTCACCTCAG TTGGGAAGGC TGTCCATCAA GTGTTCGGAG GAGCATTCCG 

.SLF GGMS WIT QGL LGAL LLW 
3661 CTCACTGTTC GGAGGCATGT CCTGGATAAC GCAAGGATTG CTGGGGGCTC TCCTGTTGTG 

.MGI NARD RSI ALT FLAV GGV 
3721 GATGGGCATC AATGCTCGTG ATAGGTCCAT AGCTCTCACG TTTCTCGCAG TTGGAGGAGT 

.LLF LSVN VHA 
3781 TCTGCTCTTC CTCTCCGTGA ACGTGCACGC TTAATTTTTA TCTAGAATCG ATCCCGGGTT 



^ C5L 



3841 


TTTATGACTA 


GTTAATCACG GCCGCCTTAT 


AAAGATCTAA 


AATGCATAAT 


TTCTAAATAA 


3901 


TGAAAAAAAG 


TACATCATGA GCAACGCGTT 


AGTATATTTT 


ACAATGGAGA 


TTAACGCTCT 


3961 


ATACCGTTCT ATGTTTATTG ATTCAGATGA 


TGTTTTAGAA 


AAGAAAGTTA 


TTGAATATGA 


4021 


AAACTTTAAT 


GAAGATGAAG ATGACGACGA 


TGATTATTGT 


TGTAAATCTG 


TTTTAGATGA 


4081 


AGAAGATGAC 


GCGCTAAAGT ATACTATGGT 


TACAAAGTAT 


AAGTCTATAC 


TACTAATGGC 


4141 


GACTTGTGCA 


AGAAGGTATA GTATAGTGAA 


AATGTTGTTA 


GATTATGATT 


ATGAAAAACC 


4201 


AAATAAATCA 


GATCCATATC TAAAGGTATC 


TCCTTTGCAC 


ATAATTTCAT 


CTATTCCTAG 


4261 


TTTAGAATAC 











Rgure 3. Construction of a pF8 H6p WNV prM-M-E donor plasmid, pSL-5513-1-1-1. 
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Figure 4. Sequence of F8 H6p WNV prM-M-E F8 in pSL-5513-1-1-1. 

=> F8R 

1 GACCCTTTAC AAGAATAAAA GAAGAAACAA CTGTGAAATA GTTTATAAAT GTAATTCGTA 

61 TGCAGAAAAC GATAATATAT TTTGGTATGA GAAATCTAAA GGAGACATAG TTTGTATAGA 

121 CATGCGCTCT TCCGATGAGA TATTCGATGC TTTTCTAATG TATCATATAG CTACAAGATA 

181 TGCCTATCAT GATGATGATA TATATCTACA AATAGTGTTA TATTATTCTA ATAATCAAAA 

241 TGTTATATCT TATATTACGA AAAATAAATA CGTTAAGTAT ATAAGAAATA AAACTAGAGA 

301 CGATATTCAT AAAGTAAAAA TATTAGCTCT AGAAGACTTT ACAACGGAAG AAATATATTG 

361 TTGGATTAGT AATATATAAC AGCGTAGCTG CACGGTTTTG ATCATTTTCC AACAATATAA 

421 ACCAATGAAG GAGGACGACT CATCAAACAT AAATAACATT CACGGAAAAT ATTCAGTATC 

481 AGATTTATCA CAAGATGATT ATGTTATTGA ATGTATAGAC GGATCTTTTG ATTCGATCAA 

541 GTATAGAGAT ATAAAGGTTA TAATAATGAA GAATAACGGT TACGTTAATT GTAGTAAATT 

601 ATGTAAAATG CGGAATAAAT ACTTTTCTAG ATGGTTGCGT CTTTCTACTT CTAAAGCATT 

661 ATTAGACATT TACAATAATA AGTCAGTAGA TAATGCTATT GTTAAAGTCT ATGGTAAAGG 

721 TAAGAAACTT ATTATAACAG GATTTTATCT CAAACAAAAT ATGATACGTT ATGTTATTGA 

781 GTGGATAGGG GATGATTTTA CAAACGATAT ATACAAAATG ATTAATTTCT ATAATGCGTT 

841 ATTCGGTAAC GATGAATTAA AAATAGTATC CTGTGAAAAC ACTCTATGCC CGTTTATAGA 

901 ACTTGGTAGA TGCTATTATG GTAAAAAATG TAAGTATATA CACGGAGATC AATGTGATAT 

961 CTGTGGTCTA TATATACTAC ACCCTACCGA TATTAACCAA CGAGTTTCTC ACAAGAAAAC 

1021 TTGTTTAGTA GATAGAGATT CTTTGATTGT GTTTAAAAGA AGTACCAGTA AAAAGTGTGG 

1081 CATATGCATA GAAGAAATAA ACAAAAAACA TATTTCCGAA CAGTATTTTG GAATTCTCCC 

1141 AAGTTGTAAA CATATTTTTT GCCTATCATG TATAAGACGT TGGGCAGATA CTACCAGAAA 

1201 TACAGATACT GAAAATACGT GTCCTGAATG TAGAATAGTT TTTCCTTTCA TAATACCCAG 

1261 TAGGTATTGG ATAGATAATA AATATGATAA AAAAATATTA TATAATAGAT ATAAGAAAAT 

1321 GATTTTTACA AAAATACCTA TAAGAACAAT AAAAATATAA TTACATTTAC GGAAAATAGC 

1381 TGGTTTTAGT TTACCAACTT AGAGTAATTA TCATATTGAA TCTATATTGC TAATTAGCTA 

1441 ATAAAAACCC GGGTTAATTA ATTAGTCATC AGGCAGGGCG AGAACGAGAC TATCTGCTCG 

=^ H62 

1501 TTAATTAATT AGAGCTTCTT TATTCTATAC TTAAAAAGTG AAAATAAATA CAAAGGTTCT 



1561 



TGAGGGTTGT GTTAAATTGA AAGCGAGAAA TAATCATAAA TTATTTCATT ATCGCGATAT 



=> WNV caps id leader 
M TGI A V M I G L I A S V 
1621 CCGTTAAGTT TGTATCGTAA TGACCGGAAT TGCAGTCATG ATTGGCCTGA TCGCCAGCGT 

=> WNV prM 

.GAV TLSN FQG KVM MTVN ATD 
1681 AGGAGCGGTT ACCCTCTCTA ACTTCCAAGG GAAGGTGATG ATGACGGTAA ATGCTACTGA 

•VTD VITI PTA AGK NLCI VRA 
1741 CGTCACAGAT GTCATCACGA TTCCAACAGC TGCTGGAAAG AACCTATGCA TTGTCAGAGC 

.MDV GYMC DDT ITY ECPV LSA 
1801 AATGGATGTG GGATACATGT GCGATGATAC TATCACTTAT GAATGCCCAG TGCTGTCGGC 

.GND PEDI DCW CTK SAVY VRY 
1861 TGGTAATGAT CCAGAAGACA TCGACTGTTG GTGCACAAAG TCAGCAGTCT ACGTCAGGTA 

^ WNV M 

.GRC TKTR HSR RSR R S L T V Q T 
1921 TGGAAGATGC ACCAAGACAC GCCACTCAAG ACGCAGTCGG AGGTCACTGA CAGTGCAGAC 

. H G E S T L A N K K G A W M D S T K A T 
1981 ACACGGAGAA AGCACTCTAG CGAACAAGAA GGGGGCTTGG ATGGACAGCA CCAAGGCCAC 

. R Y L V K T E S W I L R N P G Y A L V A 
2041 AAGGTATTTG GTAAAAACAG AATCATGGAT CTTGAGGAAC CCTGGATATG CCCTGGTGGC 

, A V I G W M L G S N T M Q R V V F V V L 
2101 AGCCGTCATT GGTTGGATGC TTGGGAGCAA CACCATGCAG AGAGTTGTGT TTGTCGTGCT 

=> WNV E 

. L L L V A P A Y S F NCL GMSN RDF 
2161 ATTGCTTTTG GTGGCCCCAG CTTACAGCTT CAACTGCCTT GGAATGAGCA ACAGAGACTT 

.LEG VSGA TWV DLV LEGD SCV 
2221 CTTGGAAGGA GTGTCTGGAG CAACATGGGT GGATTTGGTT CTCGAAGGCG ACAGCTGCGT 

.TIM SKDK PTI DVK MMNM EAA 
2281 GACTATCATG TCTAAGGACA AGCCTACCAT CGATGTGAAG ATGATGAATA TGGAGGCGGC 

.NLA EVRS YCY LAT VSDL STK 
2341 CAACCTGGCA GAGGTCCGCA GTTATTGCTA TTTGGCTACC GTCAGCGATC TCTCCACCAA 

.AAC PTMG EAH NDK R ADP AFV 
2401 AGCTGCGTGC CCGACCATGG GAGAAGCTCA CAATGACAAA CGTGCTGACC CAGCTTTTGT 

.CRQ GVVD RGW GNG CGLF GKG 
2461 GTGCAGACAA GGAGTGGTGG ACAGGGGCTG GGGCAACGGC TGCGGACTAT TTGGCAAAGG 

.SID TCAK FAC STK AIGR TIL 
2521 AAGCATTGAC ACATGCGCCA AATTTGCCTG CTCTACCAAG GCAATAGGAA GAACCATCTT 

.KEN IKYE VAI FVH G PTT VES 
2581 GAAAGAGAAT ATCAAGTACG AAGTGGCCAT CTTCGTGCAC GGACCAACTA CTGTGGAGTC 



.HGN ySTQ VGA TQA GRFS ITP 
2641 GCACGGAAAC TACTCCACAC AGGTTGGAGC CACTCAGGCA GGGAGATTCA GCATCACTCC 



.AAP SYTL KLG 
2701 TGCGGCGCCT TCATACACAC TAAAGCTTGG 

.PRS GIDT NAY 
2761 ACCACGGTCA GGGATTGACA CCAATGCATA 

.LVH REWF MDL 
2821 CTTGGTCCAT CGTGAGTGGT TCATGGACCT 

.VWR NRET LME 
2881 TGTGTGGAGG AACAGAGAGA CGTTAATGGA 

.VIA LGSQ EGA 
2941 TGTGATAGCA TTGGGCTCAC AAGAGGGAGC 

.VE F SSNT VKL 
3001 TGTGGAATTT TCAAGCAACA CTGTCAAGTT 

.MEK LQLK GTT 
3061 GATGGAAAAA TTGCAGTTGA AGGGAACAAC 

.LGT PADT GHG 
3121 TCTTGGGACT CCCGCAGACA CAGGTCACGG 

.TDG PCKV PIS 
3181 CACGGATGGA CCTTGCAAAG TTCCTATCTC 

.VGR LVTV NPF 
3241 AGTGGGCAGA TTGGTCACTG TCAACCCTTT 

.LIE LEPP FGD 
3301 CCTGATTGAA TTGGAACCAC CCTTTGGAGA 

.QIN HHWH KSG 
3361 ACAGATCAAT CACCATTGGC ACAAGTCTGG 

.LKG AQRL AAL 
3421 CCTCAAAGGA GCGCAGAGAC TAGCCGCTCT 

.GGV FTSV GKA 
3481 TGGAGGGGTG TTCACCTCAG TTGGGAAGGC 

.SLF GGMS WIT 
3541 CTCACTGTTC GGAGGCATGT CCTGGATAAC 

.MGI NARD RSI 
3601 GATGGGCATC AATGCTCGTG ATAGGTCCAT 

.LLF LSVN VHA 
3661 TCTGCTCTTC CTCTCCGTGA ACGTGCACGC 



EYG EVTV DCE 
AGAATATGGA GAGGTGACAG TGGACTGTGA 

YVM TVGT KTF 
CTACGTGATG ACTGTTGGAA CAAAGACGTT 

NLP WSSA GST 
CAACCTCCCT TGGAGCAGTG CTGGAAGTAC 

FEE PHAT KQS 
GTTTGAGGAA CCACACGCCA CGAAGCAGTC 

LHQ ALAG AIP 
TCTGCATCAA GCTTTGGCTG GAGCCATTCC 

TSG HLKC RVK 
GACGTCGGGT CATTTGAAGT GTAGAGTGAA 

YGV CSKA FKF 
CTATGGCGTC TGTTCAAAGG CTTTCAAGTT 

TVV LE LQ YTG 
CACTGTGGTG TTGGAATTGC AGTACACTGG 

SAA SLND LTP 
GTCAGCGGCT TCATTGAACG ACCTAACGCC 

VSV ATAN AKV 
TGTTTCAGTG GCCACGGCCA ACGCTAAGGT 

SYI VVGR GEQ 
CTCATACATA GTGGTGGGCA GAGGAGAACA 

SSI GKAF TTT 
AAGCAGCATT GGCAAAGCCT TTACAACCAC 

GDT AWDF GSV 
AGGAGACACA GCTTGGGACT TTGGATCAGT 

VHQ VFGG AFR 
TGTCCATCAA GTGTTCGGAG. GAGCATTCCG 

QG L LGAL LLW 
GCAAGGATTG CTGGGGGCTC TCCTGTTGTG 

ALT FLAV GGV 
AGCTCTCACG TTTCTCGCAG TTGGAGGAGT 



l^TAATTTTTA TCTAGAGTCG AGTTTTTATT 

















3721 


GACTAGTTAA 


=> F8L 
TCATAAGATA 


AATAATATAC 


AGCATTGTAA 


CCATCGTCAT 


CCGTTATACG 


3781 


GGGAATAATA 


TTACCATACA 


GTATTATTAA 


ATTTTCTTAC 


GAAGAATATA 


GATCGGTATT 


3841 


TATCGTTAGT 


TTATTTTACA 


TTTATTAATT AAACATGTCT ACTATTACCT GTTATGGAAA 


3901 


TGACAAATTT 


AGTTATATAA 






A TAATAATGA 


AATCAAATAA 


3961 


TTATGTAAAT 


GCTACTAGAT 




ACGAGGAAGA 


AAGTTTACGA 


ACTGGAAAAA 


4021 


ATTAAGTGAA 


TCTAAAATAT 


1 Mia 1 L< i An 


TGTAAAAAAA 


ATAAATGATA 


AAACTAACCA 


4081 


GTTAAAAACG 


GATATGATTA 




GGATATTGAT 


CATAAAGGAA 


GAGATACTTG 


4141 


CGGTTACTAT 


GTACACCAAG 




TTCTATATCA 


AATTGGATAT 


CTCCGTTATT 


4201 


CGCCGTTAAG 


GTAAATAAAA 




TTATATATGT 


AATGAATATG 


ATATACGACT 


4261 


TAGCGAAATG 


GAATCTGATA 




AATAGATGTA 


GTTGATAAAT 


TAGTAGGAGG 


4321 


ATACAATGAT 


GAAATAGCAG 


a fl 21 Th 21 T21 T21 


TTTGTTTAAT AAATTTATAG AAAAATATAT 


4381 


TGCTAACATA 


TCGTTATCAA 


r"vn^ 21 TT21 Tr* 


TAGTATATTA 


AATAATTTTA 


TAAATTTTAA 


4441 


TAAAAAATAC 


AATAACGACA 




TAAATCTTTA 


ATTCTTGATC 


TGAAAAACAC 


4501 


ATCTATAAAA 


CTAGATAAAA 




TAAAGATAAT AATGAATCGA 


ACGATGAAAA 


4561 


ATTGGAAACA 


GAAGTTGATA 


» r*rirp7i 21 TTTT 
/iurd/iril 111 


TTTCATCTAA 


ATAGTATTAT 


TTTATTGAAG 


4621 


TACGAAGTTT 


TACGTTAGAT 


AAA i AA i Afi/l 


GGTCGATTTT 


TACTTTGTTA 


AATATCAAAT 


4681 


ATGTCATTAT 


CTGATAAAGA 


Til P21 21 21 iH r'il 


CACGGTGATT 


ATCAACCATC 


TAACGAACAG 


4741 


ATATTACAAA 


AAATACGTCG 


GACTATGGAA 


AACGAAGCTG 


ATAGCCTCAA 


TAGAAGAAGC 


4801 


ATTAAAGAAA 


TTGTTGTAGA 


TGTTATGAAG 


AATTGGGATC 


ATCCTCTCAA 


CGAAGAAATA 


4861 


GATAAAGTTC 


TAAACTGGAA 


AAATGATACA 


TTAAACGATT 


TAGATCATCT 


AAATACAGAT 


4921 


GATAATATTA 


AGGAAATCAT 


ACAATGTCTG 


ATTAGAGAAT 


TTGCGTTTAA 


AAAGATCAAT 


4981 


TCTATTATGT 


ATAGTTATGC 


TATGGTAAAA 


CTCAATTCAG 


ATAACGAAAC 


ATTGAAAGAT 


5041 


AAAATTAAGG 


ATTATTTTAT 


AGAAACTATT 


CTTAAAGACA 


AACGTGGTTA 


TAAACAAAAG 
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Figure 5. Immunobl'ot analysis of the expression of WNV proteins from pox recombinants in 
CEFs 



Pellets Supematants 




Lane 1:ALVAC 
Lane 2: Fowlpox 
Lane 3: vCP2017 24h harvest 
Lane 4: vCP2017 48h harvest 
Lane 5: vCP2018 24h harvest 
Lane 6: vCP2018 48h harvest 
Lane 7: vFP2000 24h harvest 
Lane 8: vFP2000 48h harvest 



VCP2017 = ALVAC WNV prM-M-E 
VCP201 8 = ALVAC-2 WNV prM-M-E 
VFP2000 = Fowlpox WNV prM-M-E 



Figure 6. Immunoblbt analysis of the expression of WNV proteins from pox recombinants in 
BHK cells 



1 2 3 4 5 6 7 8 9 10 11 12 




Lane 1: VFP2000.3.2.1.2.1 pellet 
Lane 2: vCP2017.3.3 pellet 
Lane 3: vCP201 8.6.3.3.2 pellet 
Lane 4: mock infected BHK pellet 
Lane 5: pTriEx-WNV transfection pellet 
Lane 6: mock transfected BHK pellet 
Lane 7: vFP2000.3.2. 1.2.1 supt 
Lane 8: vCP201 7.3.3 supt 
Lane 9: vCP201 8.6.3.3.2 supt 
Lane 10: mock infected BHK supt 
Lane 11 : pTriEx-WNV transfection supt 
Lane 12: mock transfection supt 



vCP2017 = ALVAC WNV 
VCP201 8 - ALVAC-2 WNV 
VFP2000 = Fowlpox WNV 



Figure 7. Construction of pVR1012 WNV prM-M-E, pSL-5448-1-1. 
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Figure 8. Nucleotide sequence and translation of the WNV prM-M-E region In pSL-5448-1-1 . 
PVR1012WNV prM'M'E, 



PstI Kozak => WNV capsid leader 

MG S T G I A V M I G L I A S V - 

1 |CTGCAG| C CGC CACCATGGG A TCAACCGGAA TTGCAGTCAT GATTGGCCTG ATCGCCAGCG 

=> WNV prM 

..GAV TLS NFQG KVM MTV NATD- 
61 TAGGAGCAGT TACCCTCTCT AACTTCCAAG GGAAGGTGAT GATGACGGTA AATGCTACTG 

..VXD VIT IPTA AGK NLC IVRA- 
121 ACGTCACAGA TGTCATCACG ATTCCAACAG CTGCTGGAAA GAACCTATGC ATTGTCAGAG 

..MDV GYM CDDT ITY ECP VLSA- 
181 CAATGGATGT GGGATACATG TGCGATGATA CTATCACTTA TGAATGCCCA GTGCTGTCGG 

,.GND PED IDCW CTK SAV YVRY- 
241 CTGGTAATGA TCCAGAAGAC ATCGACTGTT GGTGCACAAA GTCAGCAGTC TACGTCAGGT 

=> WNV M 

..GRC TKT RHSR RSR RSL T V Q T - 
301 ATGGAAGATG CACCAAGACA CGCCACTCAA GACGCAGTCG GAGG TCACTG ACAGTGCAGA 

H G E STL A N K K G A W M D S T K A T * 
361 CACACGGAGA AAGCACTCTA GCGAACAAGA AGGGGGCTTG GATGGACAGC ACCAAGGCCA 

, , R y L V ^ K T E S W I L R N P G Y ALVA- 
421 CAAGGTATTT GGTAAAAACA GAATCATGGA TCTTGAGGAA CCCTGGATAT GCCCTGGTGG 

, . A V I G W M L G S N T M Q R V V F V V L - 
481 CAGCCGTCAT TGGTTGGATG CTTGGGAGCA ACACCATGCA GAGAGTTGTG TTTGTCGTGC 

=> WNV E 

..LLL V A P A Y S F NCL GMS NRDF- 
541 TATTGCTTTT GGTGGCCCCA GCTTACAGCT TCAACTGCCT TGGAATGAGC AACAGAGACT 

..LEG VSG ATWV DLV LEG DSCV- 
601 TCTTGGAAGG AGTGTCTGGA GCAACATGGG TGGATTTGGT TCTCGAAGGC GACAGCTGCG 

Clal 

..TIM SKD KPT I D VK MMN MEAA- 
661 TGACTATCAT GTCTAAGGAC AAGCCTACC |A TCGATp TGAA GATGATGAAT ATGGAGGCGG 

..NLA EVR SYCY LAT VSD LSTK- 
721 CCAACCTGGC AGAGGTCCGC AGTTATTGCT ATTTGGCTAC CGTCAGCGAT CTCTCCACCA 

..AAC PTM GEAH NDK RAD PAFV- 
781 AAGCTGCGTG CCCGACCATG GGAGAAGCTC ACAATGACAA ACGTGCTGAC CCAGCTTTTG 

..CRQ GVV DRGW GNG CGL FGKG- 
841 TGTGCAGACA AGGAGTGGTG GACT^GGGGCT GGGGCAACGG CTGCGGACTA TTTGGCAAAG 

..SID TCA KFAC STK AIG RTIL- 
901 GAAGCATTGA CACATGCGCC AAATTTGCCT GCTCTACCAA GGCAATAGGA AGAACCATCT 

..KEN IKY EVAI FVH GPT TVES- 
961 TGAAAGAGAA TATCAAGTAC GAAGTGGCCA TTTTTGTCCA TGGACCAACT ACTGTGGAGT 

..HGN YST QVGA TQA GRF SITP- 
1021 CGCACGGAAA CTACTCCACA CAGGTTGGAG CCACTCAGGC AGGGAGATTC AGCATCACTC 



..AAP SYT LKLG EYG EVT VDCE 
CTGCGGCGCC TTCATACACA CTAAAGCTTG GAGAATATGG AGAGGTGACA GTGGACTGTG 



1141 



1201 



1261 



1321 



X381 



1441 



1501 



1561 



1621 



1681 



..PRS GID TNAY YVM TVG TKTF 
AACCACGGTC AGGGATTGAC ACCAATGCAT ACTACGTGAT GACTGTTGGA ACAAAGACGT 

..LVH REW FMDL NLP WSS AGST 
TCTTGGTCCA TCGTGAGTGG TTCATGGACC TCAACCTCCC TTGGAGCAGT GCTGGAAGTA 

..VWR NRE TLME FEE PHA TKQS 
CTGTGTGGAG GAACAGAGAG ACGTTAATGG AGTTTGAGGA ACCACACGCC ACGAAGCAGT 

..VIA LGS QEGA LHQ ALA GAIP 
CTGTGATAGC ATTGGGCTCA CAAGAGGGAG CTCTGCATCA AGCTTTGGCT GGAGCCATTC 

••VEF SSN TVKL TSG HLK CRVK 
CTGTGGAATT TTCAAGCAAC ACTGTCAAGT TGACGTCGGG TCATTTGAAG TGTAGAGTGA 

•MEK LQL KGTT YGV CSK AFKF 
AGATGGAAAA ATTGCAGTTG AAGGGAACAA CCTATGGCGT CTGTTCAAAG GCTTTCAAGT 

..LG T PAD TGHG TVV LEL QYTG 
TTCTTGGGAC TCCCGCAGAC ACAGGTCACG GCACTGTGGT GTTGGAATTG CAGTACACTG 

..TDG PCK VPIS SAA SLN DLTP 
GCACGGATGG ACCTTGCAAA GTTCCTATCT CGTCAGCGGC TTC7VTTGAAC GACCTAACGC 

..VGR LVT VNPF VSV ATA NAKV 
CAGTGGGCAG ATTGGTCACT GTCAACCCTT TTGTTTCAGT GGCCACGGCC AACGCTAAGG 

• ' I E LEP PFGD SYI VVG RGEQ- 
TCCTGATTGA ATTGGAACCA CCCTTTGGAG ACTCATACAT AGTGGTGGGC AGAGGAGAAC 



1741 



1801 



..QIN HHW HKSG SSI GKA FTTT 
AACAGATCAA TCACCATTGG CACAAGTCTG GAAGCAGCAT TGGCAAAGCC TTTACAACCA 

..LKG AQR LAAL GDT AWD FGSV 
CCCTCAAAGG AGCGCAGAGA CTAGCCGCTC TAGGAGACAC AGCTTGGGAC TTTGGATCAG 



1861 



1921 



1981 



2041 



••GGV FTS VGKA VHQ VFG GAFR 
TTGGAGGGGT GTTCACCTCA GTTGGGAAGG CTGTCCATCA AGTGTTCGGA GGAGCATTCC 

..SLF GGM SWIT QGL LGA LLLW 

GCTCACTGTT CGGAGGCATG TCCTGGATAA CGCAAGGATT GCTGGGGGCT CTCCTGTTGT 

..MGI NAR DRSI ALT FLA VGGV 

GGATGGGCAT CAATGCTCGT GATAGGTCCA TAGCTCTCAC GTTTCTCGCA GTTGGAGGAG 

Xbal 

..LLF LSV NVHA 

TTCTGCTCTT CCTCTCCGTG AACGTGCACG CTTAATTTTT A |TCTAGfl| 
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FIGURE 9 

Sequence of pDS-2946-1-1 . pC5 H6p WNV prM-M-E. 

1 GCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCyVGCTGGCA 

6 1 CGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCT 

12 1 CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAAT 

=> C5R 

181 TGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACC7VTGATTACGAATTGCGGCC 

241 GCAATTCTGAATGTTAAATGTTATACTTTGGATGAAGCTATAAATATGCATTGGAAAAAT 

301 AArCCATTTAAAGAAAGGATTCAAATACTACAAAACCTAAGCGATAATAro 

361 CTTATrCTTAACGACGCrTTAAArArACACAAATAAACATAATTTTTO 

421 AATAACTAAAACATAAAAATAATAAAAGGAAATGTAATATCGTAATTATTTTACTCAGGA 

481 ATGGGGTTAAATATTTATATCACGTGTATATCTATACTGTTATCGTATACTCTTTACAAT 

54 1 TACrArrACGAArATGCAAGAGArAArAAGArTACGTATrrAAGAGAATCrTGTCATGAT 

601 AATTGGGTACGACATAGTGATAAATGCTATTTCGCATCGTTACATAAAGTCAGTTGGAAA 

661 GATGGATTTGACAGATGTAACTTAATAGGTGCAAAAATGTTAAATAACAGCATTCTATCG 

721 GAAGATAGGATACCAGTTATATTATACAAAAATCACTGGTTGGATAAAACAGATTCTGCA 

781 ArArTCGTAAAAGATGAAGATTACTGCGAATTTOrAAACTATOACAATAAAA^ 

841 ATCTCAACGACATCGTGTAATTCTTCCATGTTTTATGTATGTGTTTCAGATATTATGAGA 

901 TrACrATAAACTTrTrGTArACrTATArrCCGTAAACTATATTAArCArGAAGAAAATGA 

961 AAAAGrATAGAAGCrGTrCACGAGCGGrrGrTGAAAACAACAAAATTATACATrCAAGAr 

1021 GGCrrACATATACGrCTGTGAGGCrArCATGGATAATGACAATGCArCTCrAAATAGGr^ 

1081 TTTGGACAATGGATTCGACCCTAACACGGAATATGGTACTCTACAATCTCCTCTTGAAAT 

1141 GGCTGTAATGTTCAAGAATACCGAGGCTATAAAAATCTTGATGAGGTATGGAGCTAAACC 

1201 TGTAGTTACTGAATGCACAACTTCTTGTCTGCATGATGCGGTGTTGAGAGACGACTACAA 

1261 AATAGTGAAAGATCTGTTGAAGAATAACTATGTAAACAATGTTCTTTACAGCGGAGGCTT 

1321 TACTCCTTTGTGTTTGGCAGCTTACCTTAACAAAGTTAATTTGGTTAAACTTCTATTGGC 

1381 TCArrCGGCGGATGrAGATArrrCAAACACGGArCGGTTAACrCCTCrACArArAGCCGT 

1441 ATCAAATAAAAATTTAACAATGGITAAACTTCrArrGAACAAAGGTGCTGArACTGACTr 

1501 GCTGGATAACATGGGATGTACTCCTTTAATGATCGCTGTACAATCTGGAAATATTGAAAT 

00150857 



1561 ATGTAGCACACTACTTAAAAAAAATAAAATGTCCAGAACTGGGAAAAATTGATCTTGCCA 



1621 GCTGTAATTCATGGTAGAAAAGAAGTGCTCAGGCTACTTTTCAACAAAGGAGCAGATGTA 

1681 AACTACATCTTTGAAAGAAATGGAAAATCATATACTGTTTTGGAATTGATTAAAGAAAGT 

1741 TACTCTGAGACACAAAAGAGGTAGCTGAAGTGGTACTCTCAAAGGTACGTGACTAATTAG 

1801 CTATAAAAAGGATCCGGGTTAATTAATTAGTCATCAGGCAGGGCGAGAACGAGACTATCT 

H6p 

1861 GCTCGTTAATTAATTAGAG CTTCTTTATTCTATACTTAAAAAGTGAAAATAAATACAAAG 

1921 GTTCTTGAGGGTTGTGTTAAATTGAAAGCGAGAAATAATCATAAATTATTTCATTATCGC 

^ WNV capsid leader 
MTGIAVMIGLIA 
1981 GATATCCGTTAAGTTTGTATCGTAATGACCGGAA rrGCAGrCATGATTGGCCrGATCGCC 



=> WNV prM start 
SVGAVTLSNFQGKVMMTVNA 
2041 AGCGrAGGAGCGGTTACCCTCTCTAACTTCCAAGGGAAGGTGATGATGACGGTAAATGCT 

TDVTDVITI PTAAGKNLCIV 
2101 ACTGACGTCACAGATGTCATCACGATTCCAACAGCTGCTGGAAAGAACCTATGCATTGTC 

RAMDVGYMCDDTITYECPVL 
2161 AGAGCAATGGATGTGGGATACATGTGCGATGATACTATCACTTATGAATGCCCAGTGCTG 

SAGNDPEDIDCWCTKSAVYV 
2221 TCGGCTGGTAATGATCCAGAAGACATCGACTGTTGGTGCACAAAGTCAGCAGTCTACGTC 

=> WNV M start 

RYGRCTKTRHSRRSRRSLTV 
2281 AGGTATGGAAGATGCACCAAGACACGCCACTCAAGACGCAGTCGGAGGTCACrGACAGTO 

OTHGESTLANKKGAWMDSTK 

2341 CAGACACACGGAGAAAGCACTCTAGCGAACAAGAAGGGGGCTTGGATGGACAGCACCAAG 

ATRYLVKTESWILRNPGYAL 
2401 GCCACAAGGTATTTGGTAAAAACAGAATCATGGATCTTGAGGAACCCTGGATATGCCCTG 

VAAVIGWMLGSNTMQRVVFV 
2461 GTGGCAGCCGTCATTGGTTGGATGCTTGGGAGCAACACCATGCAGAGAGTTGTGTTTGTC 

WNV E start 

VLLLLVAPAYSFNCLGMSNR 
2521 GTGCTATTGCTTTTGGTGGCCCCAGCTTACAGCTTCAACTGCCTTGGAATGAGCAACAGA 

DFLEGVSGATWVDLVLEGDS 
2581 GACTTCTTGGAAGGAGTGTCTGGAGCAACATGGGTGGATTTGGTTCTCGAAGGCGACAGC 

CVTIMSKDKPTIDVKMMNME 
2641 TGCGTGACTATCATGTCTAAGGACAAGCCTACCATCGATGTGAAGATGATGAATATGGAG 
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AANLAEVRSYCYLA TVSDLS 
2701 GCGGCCAACCTGGCAGAGGTCCGCAGTTATTGCTATTTGGCTACCGTCAGCGATCTCTCC 

TKAACPTMGEAHN DK RADPA 
2761 ACCAAAGCTGCGTGCCCGACCATGGGAGAAGCTCACAATGACAAACGTGCTGACCCAGCT 

FVCRQ GVVDRGWGNGCGLFG 
2821 TTTGTGTGCAGACAAGGAGTGGTGGACAGGGGCTGGGGCAACGGCTGCGGACTATTTGGC 

KGSIDTCAKFACSTKAIGRT 
2881 AAAGGAAGCATTGACACATGCGCCAAATTTGCCTGCTCTACCAAGGCAATAGGAAGAACC 

mutated T5NT 

ILKENIKYEVAIFVHGPTTV 
2941 ATCTTGAAAGAGAATATCAAGTACGAAGTGGCC ATCTTCGTG CACGGACCAACTACTGTG 

ESHGNYSTQVGATQAGRFSI 
3001 GAGTCGCACGGAAACTACTCCACACAGGTTGGAGCCACTCAGGCAGGGAGATTCAGCATC 

TPAAPSYTLKLGEYGEVTVD 
3061 ACTCCTGCGGCGCCTTCATACACACTAAAGCTTGGAGAATATGGAGAGGTGACAGTGGAC 

CEPRSGIDTNAYYVMTVGTK 
3121 TGTGAACCACGGTCAGGGATTGACACCAATGCATACTACGTGATGACTGTTGGAACAAAG 

TFLVHREWFMDLNLPWSSAG 
3181 ACGTTCTTGGTCCATCGTGAGTGGTTCATGGACCTCAACCTCCCTTGGAGCAGTGCTGGA 

STV WRNRETLMEFEEPHATK 
3241 AGTACTGTGTGGAGGAACAGAGAGACGTTAATGGAGTTTGAGGAACCACACGCCACGAAG 

QSVIALGSQEGALHQALAGA 
3301 CAGTCTGTGATAGCATTGGGCTCACAAGAGGGAGCTCTGCATCAAGCTTTGGCTGGAGCC 

IPVEFSSNTVKLTSGHLKCR 
3361 ATTCCTGTGGAATTTTCAAGCAACACTGTCAAGTTGACGTCGGGTCATTTGAAGTGTAGA 

VKMEKLQLKGTTYGVCSKAF 
3421 GTGAAGATGGAAAAATTGCAGTTGAAGGGAACy^CCTATGGCGTCTGTTCAAAGGCTTTC 

KFLGTPADTGHGTVVLELQY 
3481 AAGTTTCTTGGGACTCCCGCAGACACAGGTCACGGCACTGTGGTGTTGGAATTGCAGTAC 

TGTDGPCKVPISSAASLNDL 
3541 ACTGGCACGGATGGACCTTGCAAAGTTCCTATCTCGTCAGCGGCTTCATTGAACGACCTA 

TPVGRLVTVNPFVSVATANA 
3601 ACGCCAGTGGGCAGATTGGTCACTGTCAACCCTTTTGTTTCAGTGGCCACGGCCAACGCT 

KVLIELEPPFGDSYIVVGRG 
3661 AAGGTGCTGATTGAATTGGAACCACCCTTTGGAGACTCATACATAGTGGTGGGCAGAGGA 

EQQINHHWHKSGSSIGKAFT 
3721 GAACAACAGATCAATCACCATTGGCACAAGTCTGGAAGCAGCATTGGCAAAGCCTTTACA 
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ttl'kgaqrlaalgdtawdfg 
3781 accaccctcaaaggagcgcagagactagccgctctaggagacacagcttgggactttgga 

svggvftsvgkavhqvfgga 
3841 tcagttggaggggtgttcacctcagttgggaaggctgtccatcaagtgttcggaggagca 

frslfggmswitqgllgall 
3901 ttccgctcactgttcggaggcatgtcctggataacgcaaggattgctgggggctctcctg 

LWMGINA RDRS lALTFLAVG 
3961 TTGTGGATGGGCATCAATGCTCGTGATAGGTCCATAGCTCTCACGTTTCTCGCAGTTGGA 

=> C5L 

GVLLFLSVNVHA* 
4021 GGAGTTCTGCTCTTCCTCTCCGTGAACGTGCACGCITAATTTTTATCT 

4081 GGGTTTTTATGACTAGTTAATCACGGCCGCTTATAAAGATCTAAAATGCATAATTTCTAA 

4141 ATAATGAAAAAAAGTACATCATGAGCAACGCGTTAGTATATTTTACAATGGAGATTAACG 

4201 CTCTATACCGTTCTATGTTTATTGATTCAGATGATGTTTTAGAAAAGAAAGTTATTGAAT 

4261 ATGAAAACTTTAATGAAGATGAAGATGACGACGATGATTATTGTTGTAAATCTGTTTTAG 

4321 ATGAAGAAGATGACGCGCTAAAGTATACTATGGTTACAAAGTATAAGTCTATACTACTAA 

4381 TGGCGACTTGTGCAAGAAGGTATAGTATAGTGAAAATGTTGTTAGATTATGATTATGAAA 

4441 AACCAAATAAATCAGATCCATATCTAAAGGTATCTCCTTTGCACATAATTTCATCTATTC 

4501 CrAGTTTAGJUirACCrGCAGCCAAGCTTGGCACTGGCCGTCGTTTTACAACGTCGTGACT 

4561 GGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCT 

4621 GGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG 

4681 GCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCA 

4741 TATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACC 

4801 CGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGAC 

4861 AAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAAC 

4921 GCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAA 

4981 TGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTO 

5041 TATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGC 

5101 TTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTC 

5161 CCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAA 

5221 AAGATGCTGAAGATC AGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCG 

5281 GTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAG 
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5341 TTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCC 

5401 GCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTA 

5461 CGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTG 

5521 CGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTOT 

5581 ACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATAC 

5641 CAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTAT 

5701 TAACTGGCGAACTACTTACTCrAGCTTCCCGGCAACAATTAATAGACI^ 

5761 ATAAAGTTGC^GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTA 

5821 AATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTA 

5881 AGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAA 

5941 ATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAG 

6001 TTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAA^ 

6061 TGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACT 

6121 GAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCG 

6181 TAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATC 

6241 AAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATA 

6301 CTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTA 

6361 CATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 

6421 TTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGG 

6481 GGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTAC 

6541 AGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGG 

6601 TAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGT 

6661 ATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCT 

6721 CGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGOVACGCGGCCTTTTTACGGTTCCTGG 

6781 CCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATA 

6841 ACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCA 

6901 GCGAGTCAGTGAGCGAGGAAGCGGAAGA 
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FIGURE 10 

Construction of pC5 H6p WNV prM-M-E donor plasmids with a truncated H6p +/- 
truncated WNV capsid leader sequence. 





Xhal(4067t ««PiUIU3345) 



pC5 H6p WNV prM-M.£ 




pC5 H6p (t) WNV prM-M-£ WNV-L (t) 



pC5 H6p (t) WNV prM-M*£ 



FIGURE 11 

ALVAC WNV constructs with truncated H6p -^h truncated leader sequence 
Explanation of terms: 

H6p (t) is the truncated H6p promoter deleted between Nru I and the 3 -end 

WNV-L (t) refers to the truncated WNV capsid leader, which is missing the initiating Met, 
so results in a shorter leader sequence 

H6p 5'-WNV sequence In actual vCP2017: 

^ H6p 

1861 GCTCGTTAATTAATTAGAGCTTCTTTATTCTATACTTAAAAAGTGAAAATAAATACAAAG 



Nru I 



1921 



GTTCTTGAGGGTTGTGTTAAATTGAAAGCGAGAAATAATCATAAATTATTTCATTA^CGCl 



^ WNV capsid leader 
MTGIAVMIGLIA 
1981 |GA|TATCCGTTAAGTTTGTATCGTAA TGACCGGAArrGCAGTCArGArrGGCCTgATCgCC 

=> WNV prM Start 
SVGAVTLSNFQGKVMMTVNA 
2041 AGCGTAGGAGCGGTTACCCTCTCTAACTTCCAAGGGAAGGTGATGATGACGGTAAATGCT 

TDVTDVITIPTAAGKNLCIV 
2101 ACTGACGTCACAGATGTCATCACGATTCCAACAGCTGCTGGAAAGAACCTATGCATTGTC 



PGR primers for H6p (t) WNV prM-M-E WNV-L (t): 

=> WNV capsid 
Nrul TGIAVMIGL 
11344 . SL 5 ' ATTATCGCGAACCGGAATTGCAGTCATGATTGGCCTG 



KPTIDVKM 
AAGCCTACCATCGATGTGAAGATG 
7 6 1 6 . SL 3 ' TTCGGATG GTAGCTA CACTTCTAC 

Cla I 



PGR primers for H6p (t) WNV prM-M-E: 

=> WNV capsid 
Nrul MTGIAVMIGL 
11345. SL 5' ATTATCGCGAATGACCGGAATTGCAGTCATGATTGGCCTG 



FIGURE 12A 

agtagttcgc ctgtgtgagc tgacaaactt agtagtgttt gtgaggatta acaacaatta 
60 

acacagtgcg agctgtttct tagcacgaag atctcg atg tct aag aaa cca qqa 
114 

Met Ser Lys Lys Pro Gly 
1 5 

ggg ccc ggc aag age egg get gte aat atg eta aaa cgc gga atg cce 

X62 

Gly Pro Gly Lys Ser Arg Ala Val Asn Met Leu Lys Arg Gly Met Pro 
10 15 20 

cgc gtg ttg tec ttg att gga ctg aag agg get atg ttg age ctg ate 
210 

Arg Val Leu Ser Leu He Gly Leu Lys Arg Ala Met Leu Ser Leu He 

25 30 35 

gac ggc aag ggg eca ata cga ttt gtg ttg get etc ttg gcg tte ttc 

258 

Asp Gly Lys Gly Pro He Arg Phe Val Leu Ala Leu Leu Ala Phe Phe 

40 45 50 

agg ttc aca gca att get ceg ace cga gca gtg ctg gat ega tgg aga 

306 

Arg Phe Thr Ala He Ala Pro Thr Arg Ala Val Leu Asp Arg Trp Arg 
55 60 65 70 

9gt gtg aae aaa caa aca gcg atg aaa cae ett ctg agt ttt aag aag 

Gly Val Asn Lys Gin Thr Ala Met Lys His Leu Leu Ser Phe Lys Lys 
75 80 85 

gaa eta ggg ace ttg ace agt get ate aat egg egg age tea aaa caa 
402 

Glu Leu Gly Thr Leu Thr Ser Ala He Asn Arg Arg Ser Ser Lys Gin 
90 95 100 

aag aaa aga gga gga aag ace gga att gca gte atg att ggc ctg ate 
450 

Lys Lys Arg Gly Gly Lys Thr Gly He Ala Val Met He Gly Leu He 
105 110 115 

gee age gta gga gca gtt aec etc tct aae ttc eaa ggg aag gtg atg 
498 

Ala Ser Val Gly Ala Val Thr Leu Ser Asn Phe Gin Gly Lys Val Met 
120 125 130 

atg aeg gta aat get act gac gte aca gat gte ate aeg att cca aca 
546 



001S068I 



FIGURE 12B 

Met Thr Val Asn Ala Thr Asp Val Thr Asp Val He Thr He Pro Thr 
135 140 145 150 

get get gga aag aac eta tge att gte aga gca atg gat gtg gga tac 
594 

Ala Ala Gly Lys Asn Leu Cys He Val Arg Ala Met Asp Val Gly Tyr 

155 - 160 165 

atg tge gat gat aet ate aet tat gaa tge eea gtg etg teg get ggt 
642 

Met Cys Asp Asp Thr He Thr Tyr Glu Cys Pro Val Leu Ser Ala Gly 
170 175 180 

aat gat eea gaa gac ate gae tgt tgg tge aea aag tea gca gte tae 
690 

Asn Asp Pro Glu Asp He Asp Cys Trp Cys Thr Lys Ser Ala Val Tyr 

185 190 195 

gte agg tat gga aga tge aee aag aea ege eae tea aga ege agt egg 
738 

Val Arg Tyr Gly Arg Cys Thr Lys Thr Arg His Ser Arg Arg Ser Arg 
200 205 210 

agg tea etg aea gtg eag aea eae gga gaa age aet eta gcg aae aag 
786 

Arg Ser Leu Thr Val Gin Thr His Gly Glu Ser Thr Leu Ala Asn Lys 

215 220 225 230 

aag ggg get tgg atg gae age aee aag gee aea agg tat ttg gta aaa 

834 

Lys Gly Ala Trp Met Asp Ser Thr Lys Ala Thr Arg Tyr Leu Val Lys 
235 240 245 

aea gaa tea tgg ate ttg agg aae eet gga tat gee etg gtg gca gee 
882 

Thr Glu Ser Tnp He Leu Arg Asn Pro Gly Tyr Ala Leu Val Ala Ala 
250 255 260 

gte att ggt tgg atg ett ggg age aae ace atg eag aga gtt gtg ttt 

930 

Val He Gly Trp Met Leu Gly Ser Asn Thr Met Gin Arg Val Val Phe 
265 270 275 

gte gtg eta ttg ett ttg gtg gee eea get tae age ttc aae tge ett 
978 

Val Val Leu Leu Leu Leu Val Ala Pro Ala Tyr Ser Phe Asn Cys Leu 
280 285 290 

gga atg age aae aga gac ttc ttg gaa gga gtg tct gga gca aea tgg 
1026 
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FIGURE 12C 

Gly Met Ser Asn Arg Asp Phe Leu Glu Gly Val Ser Gly Ala Thr Trp 
295 300 305 310 

gtg gat ttg gtt etc gaa ggc gac age tgc gtg act ate atg tct aag 
1074 

Val Asp Leu Val Leu Glu Gly Asp Ser Cys Val Thr lie Met Ser Lys 
315 320 325 

gae aag cct ace ate gat gtg aag atg atg aat atg gag gcg gcc aac 
1122 

Asp Lys Pro Thr lie Asp Val Lys Met Met Asn Met Glu Ala Ala Asn 
330 335 340 

etg gca gag gte egc agt tat tgc tat ttg get ace gtc age gat etc 
1170 

Leu Ala Glu Val Arg Ser Tyr Cys Tyr Leu Ala Thr Val Ser Asp Leu 
345 350 355 

tee ace aaa get gcg tgc eeg ace atg gga gaa .get cac aat gac aaa 
1218 

Ser Thr Lys Ala Ala Cys Pro Thr Met Gly Glu Ala His Asn Asp Lys 

360 365 370 

cgt get gae cea get ttt gtg tgc aga caa gga gtg gtg gae agg ggc 
1266 

Arg Ala Asp Pro Ala Phe Val Cys - Arg Gin Gly Val Val Asp Arg Gly 

375 380 385 390 

tgg ggc aac ggc tgc gga eta ttt ggc aaa gga age att gac aca tgc 

1314 

Trp Gly Asn Gly Cys Gly Leu Phe Gly Lys Gly Ser lie Asp Thr Cys 
395 400 405 

gee aaa ttt gee tgc tct ace aag gca ata gga aga ace ate ttg aaa 
1362 

Ala Lys Phe Ala Cys Ser Thr Lys Ala He Gly Arg Thr He Leu Lys 
410 415 420 

gag aat ate aag tac gaa gtg gee att ttt gtc cat gga cea act act 
1410 

Glu Asn He Lys Tyr Glu Val Ala He Phe Val His Gly Pro Thr Thr 

425 430 435 

gtg gag teg cac gga aac tac tee aca cag gtt gga gee act eag gca 
1458 

Val Glu Ser His Gly Asn Tyr Ser Thr Gin Val Gly Ala Thr Gin Ala 
440 445 450 

ggg aga ttc age ate act cct gcg gcg cct tea tac aca eta aag ett 
1506 
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FIGURE 12D 

Gly Arg Phe Ser lie Thr Pro Ala Ala Pro Ser Tyr Thr Leu Lys Leu 
455 460 465 470 

gga gaa tat gga gag gtg aca gtg gac tgt gaa cca egg tea ggg att 
1554 

Gly Glu Tyr Gly Glu Val Thr Val Asp Cys Glu Pro Arg Ser Gly lie 
475 480 485 

gac acc aat gca tac tac gtg atg act gtt gga aca aag acg ttc ttg 

1602 

Asp Thr Asn Ala Tyr Tyr Val Met Thr Val Gly Thr Lys Thr Phe Leu 
490 495 500 

gtc cat cgt gag tgg ttc atg gac etc aac etc ect tgg age agt get 
1650 

Val His Arg Glu Trp Phe Met Asp Leu Asn Leu Pro Trp Ser Ser Ala 
505 510 515 

gga agt act gtg tgg agg aac aga gag acg tta atg gag ttt gag gaa 
1698 

Gly Ser Thr Val Trp Arg Asn Arg Glu Thr Leu Met Glu Phe Glu Glu 
520 525 530 

cca cac gee acg aag cag tet gtg ata gca ttg gge tea caa gag gga 
1746 

Pro His Ala Thr Lys Gin Ser Val He Ala Leu Gly Ser Gin Glu Gly 
535 540 545 550 

get ctg cat caa get ttg get gga gee att ect gtg gaa ttt tea age 
1794 

Ala Leu His Gin Ala Leu Ala Gly Ala He Pro Val Glu Phe Ser Ser 
555 560 565 

aac act gtc aag ttg acg teg ggt cat ttg aag tgt aga gtg aag atg 
1842 

Asn Thr Val Lys Leu Thr Ser Gly His Leu Lys Cys Arg Val Lys Met 

570 575 580 

gaa aaa ttg cag ttg aag gga aca ace tat gge gtc tgt tea aag get 
1890 

Glu Lys Leu Gin Leu Lys Gly Thr Thr Tyr Gly Val Cys Ser Lys Ala 
585 590 595 

ttc aag ttt ett ggg act cec gca gac aca ggt cac gge act gtg gtg 
1938 

Phe Lys Phe Leu Gly Thr Pro Ala Asp Thr Gly His Gly Thr Val Val 
600 605 610 

ttg gaa ttg cag tac act gge acg gat gga ect tgc" aaa gtt ect ate 
1986 



FIGURE 12E 

Leu Glu Leu Gin Tyr Thr Gly Thr Asp Gly Pro Cys Lys Val Pro lie 
615 620 625 630 

teg tea gtg get tea ttg aae gae eta aeg eea gtg ggc aga ttg gte 
2034 

Ser Ser Val Ala Ser Leu Asn Asp Leu Thr Pro Val Gly Arg Leu Val 
635 640 645 

aet gte aac ect ttt gtt tea gtg gee aeg gee aae get aag gte etg 
2082 

Thr Val Asn Pro Phe Val Ser Val Ala Thr Ala Asn Ala Lys Val Leu 
650 655 660 

att gaa ttg gaa eea eee ttt gga gae tea tae ata gtg gtg gge aga 
2130 

lie Glu Leu Glu Pro Pro Phe Gly Asp Ser Tyr lie Val Val Gly Arg 
665 670 675 

gga gaa eaa eag ate aat eae eat tgg eae aag tet gga age age att 
2178 

Gly Glu Gin Gin lie Asn His His Trp His Lys Ser Gly Ser Ser lie 
680 685 690 

gge aaa gee ttt aea aee aee ete aaa gga geg eag aga eta gee get 
2226 

Gly Lys Ala Phe Thr Thr Thr Leu Lys Gly Ala Gin Arg Leu Ala Ala 
695 700 705 710 

eta gga gae aea get tgg gae ttt gga tea gtt gga ggg gtg tte ace 

2274 

Leu Gly Asp Thr Ala Trp Asp Phe Gly Ser Val Gly Gly Val Phe Thr 

715 720 725 

tea gtt ggg aag get gte eat eaa gtg tte gga gga gca tte ege tea 
2322 

Ser Val Gly Lys Ala Val His Gin Val Phe Gly Gly Ala Phe Arg Ser 
730 735 740 

etg tte gga gge atg tee tgg ata aeg eaa gga ttg etg ggg get ete 
2370 

Leu Phe Gly Gly Met Ser Trp lie Thr Gin Gly Leu Leu Gly Ala Leu 
745 750 755 

etg ttg tgg atg ggc ate aat get egt gat agg tee ata get ete aeg 
2418 

Leu Leu Trp Met Gly lie Asn Ala Arg Asp Arg Ser lie Ala Leu Thr 
760 765 770 

ttt etc gea gtt gga gga gtt etg etc tte ete tee gtg aae gtg eae 
2466 



FIGURE 12F 

Phe Leu Ala Val Gly Gly Val Leu Leu Phe Leu Ser Val Asn Val His 
775 780 785 790 

get gac act ggg tgt gcc ata gac ate age egg eaa gag etg aga tgt 
2514 

Ala Asp Thr Gly Cys Ala lie Asp He Ser Arg Gin Glu Leu Arg Cys 
795 800 805 

gga agt gga gtg ttc ata cae aat gat gtg gag get tgg atg gac egg 
2562 

Gly Ser Gly Val Phe He His Asn Asp Val Glu Ala Trp Met Asp Arg 
810 815 820 

tac aag tat tac cet gaa acg cea caa ggc eta gee aag ate att cag 
2610 

Tyr Lys Tyr Tyr Pro Glu Thr Pro Gin Gly Leu Ala Lys He lie Gin 
825 830 835 

aaa get eat aag gaa gga gtg tge ggt eta ega tea gtt tee aga etg 

2658 

Lys Ala His Lys Glu Gly Val Cys Gly Leu Arg Ser Val Ser Arg Leu 
840 845 850 

gag cat eaa atg tgg gaa gea gtg aag gae gag etg aae act ett ttg 

2706 

Glu His Gin Met Trp Glu Ala Val Lys Asp Glu Leu Asn Thr Leu Leu 

855 860 865 870 

aag gag aat ggt gtg gac ett agt gtc gtg gtt gag aaa cag gag gga 

2754 

Lys Glu Asn Gly Val Asp Leu Ser Val Val Val Glu Lys Gin Glu Gly 
875 880 885 

atg tac aag tea gea cet aaa ege etc ace gee ace acg gaa aaa ttg 
2802 

Met Tyr Lys Ser Ala Pro Lys Arg Leu Thr Ala Thr Thr Glu Lys Leu 
890 895 900 

gaa att ggc tgg aag gee tgg gga aag agt att tta ttt gea eca gaa 

2850 

Glu He Gly Trp Lys Ala Trp Gly Lys Ser He Leu Phe Ala Pro Glu 
905 910 915 

etc gee aae aae ace ttt gtg gtt gat ggt ecg gag ace aag gaa tgt 
2898 

Leu Ala Asn Asn Thr Phe Val Val Asp Gly Pro Glu Thr Lys Glu Cys 

920 925 930 

ecg act cag aat ege get tgg aat age tta gaa gtg gag gat ttt gga 
2946 
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FIGURE 12G 

Pro Thr Gin Asn Arg Ala Trp Asn Ser Leu Glu Val Glu Asp Phe Gly 
935 940 945 950 

ttt ggt etc acc age act egg atg ttc ctg aag gtc aga gag age aac 
2994 

Phe Gly Leu Thr Ser Thr Arg Met Phe Leu Lys Val Arg Glu Ser Asn 
955 960 965 

aca act gaa tgt gac teg aag ate att gga acg get gtc aag aac aac 
3042 

Thr Thr Glu Cys Asp Ser Lys lie lie Gly Thr Ala Val Lys Asn Asn 
970 975 980 

ttg gcg ate cac agt gac ctg tec tat tgg att gaa age agg etc aat 
3090 

Leu Ala lie His Ser Asp Leu Ser Tyr Trp lie Glu Ser Arg Leu Asn 
985 990 995 

gat acg tgg aag ett gaa agg gea gtt ctg ggt gaa gtc aaa tea 

3135 

Asp Thr Trp Lys Leu Glu Arg Ala Val Leu Gly Glu Val Lys Ser 
1000 1005 1010 

tgt acg tgg cet gag acg cat ace ttg tgg gge gat gga ate ett 
3180 

Cys Thr Trp Pro Glu Thr His Thr Leu Trp Gly Asp Gly lie Leu 
1015 1020 1025 

gag agt gac ttg ata ata eca gtc aca ctg gcg gga eca ega age 

3225 

Glu Ser Asp Leu He He Pro Val Thr Leu Ala Gly Pro Arg Ser 
1030 1035 1040 

aat cac aat egg aga cet ggg tac aag aca caa aac cag ggc eca 
3270 

Asn His Asn Arg Arg Pro Gly Tyr Lys Thr Gin Asn Gin Gly Pro 
1045 1050 1055 

tgg gac gaa gge egg gta gag att gac ttc gat tac tge eca gga 
3315 

Trp Asp Glu Gly Arg Val Glu He Asp Phe Asp Tyr Cys Pro Gly 
1060 1065 1070 

act acg gtc acc ctg agt gag age tge gga cac cgt gga cet gee ' 
3360 

Thr Thr Val Thr Leu Ser Glu Ser Cys Gly His Arg Gly Pro Ala 
1075 1080 1085 

act cge ace acc aca gag age gga aag ttg ata aca gat tgg tge 
3405 
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Thr Arg Thr Thr Thr 
1090 

tgc agg age tgc acc 
3450 

Cys Arg Ser Cys Thr 
1105 

ggc tgt tgg tat ggt 
3495 

Gly Cys Trp Tyr Gly 
1120 

aag acc etc gtg cag 
3540 

Lys Thr Leu Val Gin 
1135 

att gac cct ttt cag 
3585 

lie Asp Pro Phe Gin 
1150 

cag gag gtc ctt cgc 

3630 

Gin Glu Val Leu Arg 
1165 

get ata ctg att get 
3675 

Ala lie Leu He Ala 
1180 

tac act gat gtg tta 
3720 

Tyr Thr Asp Val Leu 
1195 

gea gaa tet aat teg 

3765 

Ala Glu Ser Asn Ser 
1210 

gcg acc ttc aag ata 
3810 

Ala Thr Phe Lys He 
1225 

aaa gcg aga tgg ace 
3855 



FIGURE 12H 
Glu Ser Gly Lys Leu 
1095 

tta cea cca ctg cgc 

Leu Pro Pro Leu Arg 
1110 

atg gag ate aga cca 

Met Glu He Arg Pro 
1125 

tea caa gtg aat get 

Ser Gin Val Asn Ala 
1140 

ttg ggc ctt ctg gtc 

Leu Gly Leu Leu Val 
1155 

aag agg tgg aca gee 

Lys Arg Trp Thr Ala 
1170 

ctg eta gtc ctg gtg 

Leu Leu Val Leu Val 
1185 

cgc tat gtc ate ttg 

Arg Tyr Val He Leu 
1200 

gga gga gac gtg gta 

Gly Gly Asp Val Val 
1215 

caa cca gtg ttt atg 

Gin Pro Val Phe Met 
1230 

aac cag gag aac att 



He Thr Asp Trp Cys 
1100 

tac caa act gac age 

Tyr Gin Thr Asp Ser 
1115 

cag aga cat gat gaa 

Gin Arg His Asp Glu 
1130 

tat aat get gat atg 

Tyr Asn Ala Asp Met 
1145 

gtg ttc ttg gee acc 

Val Phe Leu Ala Thr 
1160 

aag ate age atg cca 

Lys He Ser Met Pro 
1175 

ttt ggg ggc att act 

Phe Gly Gly He Thr 
1190 

gtg ggg gea get ttc 

Val Gly Ala Ala Phe 
1205 

cac ttg gcg etc atg 

His Leu Ala Leu Met 
1220 

9tg gea teg ttt etc 

Val Ala Ser Phe Leu 
1235 

ttg ttg atg ttg gcg 



Lys Ala Arg Trp Thr 
1240 

get gtt ttc ttt caa 
3900 

Ala Val Phe Phe Gin 
1255 

etc tgg gag ate cct 
3945 

Leu Trp Glu lie Pro 
1270 

atg ata ctg aga gee 
3990 

Met He Leu Arg Ala 
1285 

gtt eeg etg eta gee 
4035 

Val Pro Leu Leu Ala 
1300 

ctg gat gtg tae agg 
4080 

Leu Asp Val Tyr Arg 
1315 

ttg ate agg gag aag 
4125 

Leu He Arg Glu Lys 
1330 

agt etg eta tge ttg 
4170 

Ser Leu Leu Cys Leu 
1345 

atg ate ett get get 
4215 

Met He Leu Ala Ala 
1360 

cgc gga tgg cec gca 
4260 

Arg Gly Trp Pro Ala 
1375 

ttt gee ate gte gga 
4305 



FIGURE 121 
Asn Gin Glu Asn He 
1245 

atg get tat eae gat 

Met Ala Tyr His Asp 
1260 

gat gtg ttg aat tea 

Asp Val Leu Asn Ser 
1275 

ata aea ttc aca acg 

He Thr Phe Thr Thr 
1290 

ctg eta aea cec ggg 

Leu Leu Thr Pro Gly 
1305 

ata etg ctg ttg atg 

He Leu Leu Leu Met 
1320 

agg agt gca get gca 

Arg Ser Ala Ala Ala 
1335 

get eta gee tea aca 

Ala Leu Ala Ser Thr 
1350 

gga etg att gca tgt 

Gly Leu He Ala Cys 
1365 

act gaa gtg atg aca 

Thr Glu Val Met Thr 
1380 

ggg ctg gca gag ett 



Leu Leu Met Leu Ala 
1250 

gee cgc caa att ctg 

Ala Arg Gin He Leu 
1265 

ctg gcg gta get tgg 

Leu Ala Val Ala Trp 
1280 

aca tea aac gtg gtt 

Thr Ser Asn Val Val 
1295 

ctg aga tge ttg aat 

Leu Arg Cys Leu Asn 
1310 

gte gga ata ggc age 

Val Gly He Gly Ser 
1325 

aaa aag aaa gga gca 

Lys Lys Lys Gly Ala 
1340 

gga ett ttc aac cec 

Gly Leu Phe Asn Pro 
1355 

gat ecc aac egt aaa 

Asp Pro Asn Arg Lys 
1370 

get gte ggc eta atg 

Ala Val Gly Leu Met 
1385 

gae att gae tee atg 



Phe Ala He Val Gly 
1390 

gcc att cca atg act 
4350 

Ala He Pro Met Thr 
1405 

att tct ggg aaa tea 
4395 

He Ser Gly Lys Ser 
1420 

att tec tgg gaa agt 
4440 

He Ser Trp Glu Ser 
1435 

gtt gat gtg egg ctt 
4485 

Val Asp Val Arg Leu 
1450 

gat cca gga gca cet 
4530 

Asp Pro Gly Ala Pro 
1465 

etc gcg att agt gcg 

4575 

Leu Ala He Ser Ala 
1480 

gtt gga ttt tgg ata 
4620 

Val Gly Phe Trp He 
1495 

ttg tgg gac act ecc 
4665 

Leu Trp Asp Thr Pro 
1510 

ace ace gge gtc tac 
4710 

Thr Thr Gly Val Tyr 
1525 

tat caa gca gga gcg 
4755 



FIGURE 12J 
Gly Leu Ala Glu Leu 
1395 

ate gcg ggg^ etc atg 

He Ala Gly Leu Met 
1410 

aca gat atg tgg att 

Thr Asp Met Trp He 
1425 

gat gca gaa att aca 

Asp Ala Glu He Thr 
1440 

gat gat gat gga aac 

Asp Asp Asp Gly Asn 
1455 

tgg aag ata tgg atg 

Trp Lys He Trp Met 
1470 

tac ace ecc tgg gca 

Tyr Thr Pro Trp Ala 
1485 

act etc caa tac aca 

Thr Leu Gin Tyr Thr 
1500 

tea cca aag gag tac 

Ser Pro Lys Glu Tyr 
1515 

agg ate atg act cgt 

Arg He Met Thr Arg 
1530 

ggc gtg atg gtt gaa 



Asp He Asp Ser Met 
1400 

ttt get get ttc gtg 

Phe Ala Ala Phe Val 
1415 

gag aga acg gcg gac 

Glu Arg Thr Ala Asp 
.1430 

gge teg age gaa aga 

Gly Ser Ser Glu Arg 
1445 

ttc eag etc atg aat 

Phe Gin Leu Met Asn 
1460 

etc aga atg gtc tgt 

Leu Arg Met Val Cys 
1475 

ate ttg ecc tea gta 

He Leu Pro Ser Val 
1490 

aag aga gga gge gtg 

Lys Arg Gly Gly Val 
1505 

aaa aag ggg gac acg 

Lys Lys Gly Asp Thr 
1520 

ggg ctg etc gge agt 

Gly Leu Leu Gly Ser 
1535 

ggt gtt ttc eac ace 



■ FIGURE 12K 

-Tyr Gin Ala Gly Ala Gly Val Met Val Glu Gly Val Phe His Thr 
1540 1545 1550 

ctt tgg cat aca aca aaa gga gcc get ttg atg age gga gag ggc 
4800 

Leu Trp His Thr Thr Lys Gly Ala Ala Leu Met Ser Gly Glu Gly 
1555 1560 1565 

cgc ctg gac cca tac tgg ggc agt gtc aag gag gat cga ctt tgt 
4845 

Arg Leu Asp Pro Tyr Trp Gly Ser Val Lys Glu Asp Arg Leu Cys 
1570 1575 1580 

tac gga gga ccc tgg aaa ttg cag cac aag tgg aac ggg cag gat 
4890 

Tyr Gly Gly Pro Trp Lys Leu Gin His Lys Trp Asn Gly Gin Asp 
1585 1590 1595 

gag gtg cag atg att gtg gtg gaa cct ggc aag aac gtt aag aac 
4935 

Glu Val Gin Met He Val Val Glu Pro Gly Lys Asn Val Lys Asn 
1600 1605 1610 

gtc cag acg aaa cca ggg gtg ttc aaa aca cct gaa gga gaa ate 
4980 

Val Gin Thr Lys Pro Gly Val Phe Lys Thr Pro Glu Gly Glu He 
1615 1620 1625 

ggg gcc gtg act ttg gac ttc ccc act gga aca tea ggc tea cca 
5025 

Gly Ala Val Thr Leu Asp Phe Pro Thr Gly Thr Ser Gly Ser Pro 
1630 1635 1640 

ata gtg gac aaa aac ggt gat gtg att ggg ctt tat ggc aat gqa 

5070 

He Val Asp Lys Asn Gly Asp Val He Gly Leu Tyr Gly Asn Gly 
1645 1650 1655 

Stc^ata atg ccc aac ggc tea tac ata age gcg ata gtg cag ggt 

Val He Met Pro Asn Gly Ser Tyr He Ser Ala He Val Gin Gly 
1660 1665 1670 

gaa agg atg gat gag cca ate cca gcc gga ttc gaa cct gaq atq 
5160 

Glu Arg Met Asp Glu Pro He Pro Ala Gly Phe Glu Pro Glu Met 
1675 1680 1685 

ctg agg aaa aaa cag ate act gta ctg gat etc eat ccc qqe qec 
5205 



001S0681 



Leu Arg Lys Lys Gin 
1690 

ggt aaa aca agg agg 

5250 

Gly Lys Thr Arg Arg 
1705 

aac aga aga ctg aga 
5295 

Asn Arg Arg Leu Arg 
1720 

get get gag atg get 
5340 

Ala Ala Glu Met Ala 
1735 

cag aea tec gea gtg 
5385 

Gin Thr Ser Ala Val 
1750 

gat gte atg tgt eat 
5430 

Asp Val Met Cys His 
1765 

eae agg gtg ceg aae 

5475 

His Arg Val Pro Asn 
1780 

tte aec gae cea get 
5520 

Phe Thr Asp Pro Ala 
1795 

aag gte gag eta ggg 
5565 

Lys Val Glu Leu Gly 
1810 

eca eea gge aet tea 
5610 

Pro Pro Gly Thr Ser 
1825 

tee gae tta cag aet 
5655 



FIGURE 12L 
lie Thr Val Leu Asp 
1695 

att ctg cea cag ate 

lie Leu Pro Gin lie 
1710 

aca gee gtg eta gca 

Thr Ala Val Leu Ala 
1725 

gaa gca ctg aga gga 

Glu Ala Leu Arg Gly 
1740 

ecc aga gaa cat aat 

Pro Arg Glu His Asn 
1755 

get ace etc ace eae 

Ala Thr Leu Thr His 
1770 

tac aac ctg tte gtg 

Tyr Asn Leu Phe Val 
1785 

age att gca gca aga 

Ser lie Ala Ala Arg 
1800 

gag geg gcg gea ata 

Glu Ala Ala Ala lie 
1815 

gat eca tte eea gag 

Asp Pro Phe Pro Glu 
1830 

gag ate ccg gat ega 



Leu His Pro Gly Ala 
1700 

ate aaa gag gee ata 

He Lys Glu Ala He 
1715 

eca ace agg gtt gtg 

Pro Thr Arg Val Val 
1730 

ctg ecc ate egg tac 

Leu Pro He Arg Tyr 
1745 

gga aat gag att gtt 

Gly Asn Glu He Val 
1760 

agg ctg atg tet ect 

Arg Leu Met Ser Pro 
1775 

atg gat gag get cat 

Met Asp Glu Ala His 
1790 

ggt tac att tee aca 

Gly Tyr He Ser Thr 
1805 

tte atg aea gee aec 

Phe Met Thr Ala Thr 
1820 

tee aat tea eca att 

Ser Asn Ser Pro He 
1835 

get tgg aac tet gga 



FIGURE 12M 

Ser Asp Leu Gin Thr Glu He Pro Asp Arg Ala Trp Asn Ser Gly 
1840 1845 1850 

tac gaa tgg ate aca gaa tac acc ggg aag acg gtt tgg ttt gtg 
5700 

Tyr Glu Trp He Thr Glu Tyr Thr Gly Lys Thr Val Trp Phe Val 
1855 1860 1865 

cct agt gtc aag atg ggg aat gag att gcc ctt tgc eta caa cgt 
5745 

Pro Ser Val Lys Met Gly Asn Glu He Ala Leu Cys Leu Gin Arg 
1870 1875 1880 

get gga aag aaa gta gte eaa ttg aac aga aag teg tae gag acg 
5790 

Ala Gly Lys Lys Val Val Gin Leu Asn Arg Lys Ser Tyr Glu Thr 
1885 1890 1895 

gag tae cea aaa tgt aag aae gat gat tgg gac ttt gtt ate aca 
5835 ■ 

Glu Tyr Pro Lys Cys Lys Asn Asp Asp Trp Asp Phe Val He Thr 
1900 1905 1910 

aca gac at a tet gaa atg ggg get aae tte aag geg age agg gtg 
5880 

Thr Asp He Ser Glu Met Gly Ala Asn Phe Lys Ala Ser Arg Val 
1915 1920 1925 

att gac age egg aag agt gtg aaa cea ace ate ata aca gaa gga 

5925 

He Asp Ser Arg Lys Ser Val Lys Pro Thr He He Thr Glu Gly 
1930 1935 1940 

gaa ggg aga gtg ate ctg gga gaa cea tet gea gtg aca gca get 
5970 

Glu Gly Arg Val He Leu Gly Glu Pro Ser Ala Val Thr Ala Ala 
1945 1950 1955 

agt gcc gee eag aga cgt gga cgt ate ggt aga aat ccg teg caa 

6015 

Ser Ala Ala Gin Arg Arg Gly Arg He Gly Arg Asn Pro Ser Gin 
I960 1965 1970 

gtt ggt gat gag tac tgt tat ggg ggg cac acg aat gaa gac gac 
6060 

Val Gly Asp Glu Tyr Cys Tyr Gly Gly His Thr Asn Glu Asp Asp 
1975 1980 1985 

teg aac ttc gee cat tgg act gag gea ega ate atg ctg gac aac 
6105 



Ser Asn Phe Ala His 
1990 

ate aac atg cca aac 
6150 

lie Asn Met Pro Asn 
2005 

cgt gag aag gta tat 
6195 

Arg Glu Lys Val Tyr 
2020 

gaa gag aga aaa aac 
6240 

Glu Glu Arg Lys Asn 
2035 

cca gtt tgg ctg get 
6285 

Pro Val Trp Leu Ala 
2050 

cac gac egg agg tgg 
6330 

His Asp Arg Arg Trp 
2065 

tta gaa gac aac aac 

6375 

Leu Glu Asp Asn Asn 
2080 

agg aag att ctg agg 
6420 

Arg Lys lie Leu Arg 
2095 

gat cac cag gca eta 
6465 

Asp His Gin Ala Leu 
2110 

cgt tct cag ata ggg 
6510 

Arg Ser Gin lie Gly 
2125 

cac tte atg ggg aag 
6555 



FIGURE 12N 
Trp Thr Glu Ala Arg 
1995 

gga ctg ate get eaa 

Gly Leu lie Ala Gin 
2010 

ace atg gat ggg gaa 

Thr Met Asp Gly Glu 
2025 

ttt ctg gaa ctg ttg 

Phe Leu Glu Leu Leu 
2040 

tac aag gtt gca gcg 

Tyr Lys Val Ala Ala 
2055 

tgc ttt gat ggt cet 

Cys Phe Asp Gly Pro 
2070 

gaa gtg gaa gtc ate 

Glu Val Glu Val He 
2085 

ecg cgc tgg att gac 

Pro Arg Trp He Asp 
2100 

aag gcg tte aag gac 

Lys Ala Phe Lys Asp 
2115 

etc att gag gtt ctg 

Leu He Glu Val Leu 
2130 

aca tgg gaa gca ett 



He Met Leu Asp Asn 
2000 

tte tac caa cca gag 

Phe Tyr Gin Pro Glu 
2015 

tac egg etc aga gga 

Tyr Arg Leu Arg Gly 
2030 

agg act gca gat ctg 

Arg Thr Ala Asp Leu 
2045 

get gga gtg tea tac 

Ala Gly Val Ser Tyr 
2060 

agg aca aac aca att 

Arg Thr Asn Thr He 
2075 

aeg aag ett ggt gaa 

Thr Lys Leu Gly Glu 
2090 

gee agg gtg tac teg 

Ala Arg Val Tyr Ser 
2105 

tte gee teg gga aaa 

Phe Ala Ser Gly Lys 
2120 

gga aag atg cet gag 

Gly Lys Met Pro Glu 
2135 

gac acc atg tac gtt 
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His Phe Met Gly Lys 
2140 

gtg gcc act gca gag 
6600 

Val Ala Thr Ala Glu 
2155 

gag gaa ctg cca gat 
6645 

Glu Glu Leu Pro Asp 
2170 

ttg agt gtg atg acc 
6690 

Leu Ser Val Met Thr 
2185 

aag ggc att gga aag 
6735 

Lys Gly lie Gly Lys 
2200 

gcg acc ttt ttc tgt 
6780 

Ala Thr Phe Phe Cys 
2215 

gcc gga atg ttg ctg 

6825 

Ala Gly Met Leu Leu 
2230 

cct gag cca gag aag 
6870 

Pro Glu Pro Glu Lys 
2245 

gtg ttc ctg att tgt 
6915 

Val Phe Leu lie Cys 
2260 

aac gag atg ggt tgg 
6960 

Asn Glu Met Gly Trp 
2275 

ttg ttt ggg caa aga 
7005 



FIGURE 120 
Thr Trp Glu Ala Leu 
2145 

aaa gga gga aga get 

Lys Gly Gly Arg Ala 
2160 

get ctt cag aca att 

Ala Leu Gin Thr lie 
2175 

atg gga gta ttc ttc 

Met Gly Val Phe Phe 
2190 

ata ggt ttg gga ggc 

lie Gly Leu Gly Gly 
2205 

tgg atg get gaa gtt 

Trp Met Ala Glu Val 
2220 

etc tec ctt etc ttg 

Leu Ser Leu Leu Leu 
2235 

caa cgt teg cag aca 

Gin Arg Ser Gin Thr 
2250 

gtc atg acc ctt gtg 

Val Met Thr Leu Val 
2265 

eta gat aag acc aag 

Leu Asp Lys Thr Lys 
2280 

att gag gtc aag gag 



Asp Thr Met Tyr Val 
2150 

cae aga atg gcc ctg 

His Arg Met Ala Leu 
2165 

gcc ttg att gcc tta 

Ala Leu lie Ala Leu 
2180 

etc etc atg cag egg 

Leu Leu Met Gin Arg 
2195 

get gtc ttg gga gtc 

Ala Val Leu Gly Val 
2210 

cca gga aeg aag ate 

Pro Gly Thr Lys lie 
2225 

atg att gtg eta att 

Met He Val Leu He 
2240 

gae aac cag eta gee 

Asp Asn Gin Leu Ala 
2255 

age gca gtg gca gcc 

Ser Ala Val Ala Ala 
2270 

agt gac ata age agt 

Ser Asp He Ser Ser 
2285 

aat ttc age atg gga 
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FIGURE 12P 

Leu Phe Gly Gin Arg lie Glu Val Lys Glu Asn Phe Ser Met Gly 
2290 2295 2300 

7050^*^*^ ^^'^ '^^^ 

Glu Phe Leu Leu Asp Leu Arg Pro Ala Thr Ala Trp Ser Leu Tyr 
2305 2310 2315 

get gtg aca aca gcg gtc etc act cca ctg eta aag cat ttg ate 

7095 ass 

Ala Val Thr Thr Ala Val Leu Thr Pro Leu Leu Lys His Leu He 
2320 2325 2330 

acg tea gat tae ate aae ace tea ttg ace tea ata aac qtt eaa 
7140 ^ ^ 

Thr Ser Asp Tyr He Asn Thr Ser Leu Thr Ser He Asn Val Gin 
2335 2340 2345 

gea agt gca eta tte aca etc gcg cga gge ttc ccc ttc gtc gat 
7185 

Ala Ser Ala Leu Phe Thr Leu Ala Arg Gly Phe Pro Phe Val Asp 
2350 2355 2360 

7230^^^ 9*^9 9Ct etc ctg eta gea gee gga tgc tgg gga caa 

Val Gly Val Ser Ala Leu Leu Leu Ala Ala Gly Cys Trp Gly Gin 
2365 2370 2375 

gtc ace etc ace gtt acg gta aca geg gea aca etc ett ttt tgc 
7275 

Val Thr Leu Thr Val Thr Val Thr Ala Ala Thr Leu Leu Phe Cys 
2380 2385 2390 

73 2 o'^^'^ 

His Tyr Ala Tyr Met Val Pro Gly Trp Gin Ala Glu Ala Met Arg 
2395 2400 2405 

tea gee cag egg egg aca gcg gee gga ate atg aag aac get gta 

7365 

Ser Ala Gin Arg Arg Thr Ala Ala Gly He Met Lys Asn Ala Val 
2410 2415 2420 

gtg gat gge ate gtg gee aeg gae gtc cca gaa tta gag egc ace 
7410 

Val Asp Gly He Val Ala Thr Asp Val Pro Glu Leu Glu Arg Thr 
2425 2430 2435 

7455°°*^ ^te atg cag aag aaa gtt gga cag ate atg ctg ate ttg 
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Thr Pro lie Met Gin 
2440 

gtg tct eta get gca 
7500 

Val Ser Leu Ala Ala 
2455 

cga gaa gcc gga att 
7545 

Arg Glu Ala Gly He 
2470 

gag aat gga gca age 
7590 

Glu Asn Gly Ala Ser 
2485 

etc tgc eac ate atg 

7635 

Leu Cys His He Met 
2500 

aea tgg aea etc ata 

7680 

Thr Trp Thr Leu He 
2515 

99t ggg gca aaa gga 
7725 

Gly Gly Ala Lys Gly 
2530 

etc aae eag atg aea 

7770 

Leu Asn Gin Met Thr 
2545 

gee ate ate gaa gte 
7815 

Ala He He Glu Val 
2560 

gaa gge aat gte act 
7860 

Glu Gly Asn Val Thr 
2575 

aaa ctg aga tgg etg 
7905 



FIGURE 12Q 
Lys Lys Val Gly Gin 
2445 

gta gta gtg aae ecg 

Val Val Val Asn Pro 
• 2460 

ttg ate acg gee gca 

Leu He Thr Ala Ala 
2475 

tct gtt tgg aae gca 

Ser Val Trp Asn Ala 
. 2490 

cgt ggg ggt tgg ttg 

Arg Gly Gly Trp Leu 
2505 

aag aae atg gaa aaa 

Lys Asn Met Glu Lys 
2520 

ege ace ttg gga gag 

Arg Thr Leu Gly Glu 
2535 

aaa gaa gag tte act 

Lys Glu Glu Phe Thr 
2550 

gat ege tea geg gca 

Asp Arg Ser Ala Ala 
2565 

99^ ggg eat eea gte 

Gly Gly His Pro Val 
2580 

gte gaa egg agg ttt 



He Met Leu He Leu 
2450 

tct gtg aag aea gta 

Ser Val Lys Thr Val 
2465 

geg gtg acg ctt tgg 

Ala Val Thr Leu Trp 
2480 

aea act gcc ate gga 

Thr Thr Ala He Gly 
2495 

tea tgt eta tec ata 

Ser Cys Leu Ser He 
2510 

eea gga eta aaa aga 

Pro Gly Leu Lys Arg 
2525 

gtt tgg aaa gaa aga 

Val Trp Lys Glu Arg 
2540 

agg tac ege aaa gag 

Arg Tyr Arg Lys Glu 
2555 

aaa eac gee agg aaa 

Lys His Ala Arg Lys 
2570 

tct agg gge aea gca 

Ser Arg Gly Thr Ala 
2585 

etc gaa ecg gte gga 
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FIGURE 12R 

Lys Leu Arg Trp Leu Val Glu Arg Arg Phe Leu Glu Pro Val Gly 
2590 2595 2600 

aaa gtg att gac ctt gga tgt gga aga ggc ggt tgg tgt tac tat 
7950 

Lys Val He Asp Leu Gly Cys Gly Arg Gly Gly Trp Cys Tyr Tyr 
2605 2610 2615 

atg gca acc caa aaa aga gtc caa gaa gtc aga ggg tac aca aag 
7995 

Met Ala Thr Gin Lys Arg Val Gin Glu Val Arg Gly Tyr Thr Lys 
2620 2625 2630 

ggc ggt ccc gga cat gaa gag ccc caa eta gtg caa agt tat gga 
8040 

Gly Gly Pro Gly His Glu Glu Pro Gin Leu Val Gin Ser Tyr Gly 
" 2635 2640 2645 

tgg aac att gtc acc atg aag agt gga gtg gat gtg ttc tac aga 

8085 

Trp Asn He Val Thr Met Lys Ser Gly Val Asp Val Phe Tyr Arg 
2650 2655 2660 

cct tct gag tgt tgt gac acc etc ctt tgt gac ate gga gag tec 
8130 

Pro Ser Glu Cys Cys Asp Thr Leu Leu Cys Asp He Gly Glu Ser 
2665 2670 2675 

teg tea agt get gag gtt gaa gag cat agg acg att egg gtc ctt 

8175 

Ser Ser Ser Ala Glu Val Glu Glu His Arg Thr He Arg Val Leu 
2680 2685 2690 

gaa atg gtt gag gac tgg ctg cac cga ggg cca agg gaa ttt tgc 
8220 

Glu Met Val Glu Asp Trp Leu His Arg Gly Pro Arg Glu Phe Cys 
2695 2700 2705 

gtg aag gtg etc tgc ccc tac atg ceg aaa gtc ata gag aag atg 

8265 

Val Lys Val Leu Cys Pro Tyr Met Pro Lys Val He Glu Lys Met 

2710 . 2715 2720 

gag ctg etc caa egc egg tat ggg ggg gga ctg gtc aga aac cca 
8310 

Glu Leu Leu Gin Arg Arg Tyr Gly Gly Gly Leu Val Arg Asn Pro 
2725 2730 2735 

etc tea egg aat tec acg cac gag atg tat tgg gtg agt cga get 
8355 
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FIGURE 12S 

Leu Ser Arg Asn Ser Thr His Glu Met Tyr Trp Val Ser Arq Ala 
2740 2745 2750 

8400^^^ ^^'^ ^^"^ ""^^ 

Ser Gly Asn Val Val His Ser Val Asn Met Thr Ser Gin Val Leu 
2755 2760 2765 

cta^gga aga atg gaa aaa agg acc tgg aag gga ccc caa tac gag 

Leu Gly Arg Met Glu Lys Arg Thr Trp Lys Gly Pro Gin Tyr Glu 
2770 2775 2780 

1490^^'' ^^"^ ^""^ 

Glu Asp Val Asn Leu Gly Ser Gly Thr Arg Ala Val Gly Lys Pro 
2785 2790 2795 

ctg etc aac tea gac acc agt aaa ate aag aac agg att gaa cga 

o535 

Leu Leu Asn Ser Asp Thr Ser Lys He Lys Asn Arg He Glu Arq 
2800 2805 2810 

8580^^^ "^^^ ^^"^ ^"^^ ^^"^ 

Leu Arg Arg Glu Tyr Ser Ser Thr Trp His His Asp Glu Asn His 
2815 2820- 2825 

cca^tat aga acc tgg aac tat cac ggc agt tat gat gtg aag ccc 

Pro Tyr Arg Thr Trp Asn Tyr His Gly Ser Tyr Asp Val Lys Pro 
2830 2835 2840 

aca ggc tec gee agt teg ctg gte aat gga gtg gte agg etc etc 

Thr Gly Ser Ala Ser Ser Leu Val Asn Gly Val Val Arg Leu Leu 
2845 2850 2855 

tca^aaa eca tgg gac ace ate aeg aat gtt acc acc atg gcc atg 

Ser Lys Pro Trp Asp Thr He Thr Asn Val Thr Thr Met Ala Met 
2860 2865 2870 

act gac act act ccc ttc ggg cag eag ega gtg tte aaa gag aag 

Thr Asp Thr Thr Pro Phe Gly Gin Gin Arg Val Phe Lys Glu Lys 
2875 2880 2885 

8805^^'' ^""^ ^""^ """^ ""^^ ''^^ 9*^9 
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FIGURE 12T 

Val Asp Thr Lys Ala Pro Glu Pro Pro Glu Gly Val Lys Tyr Val 
2890 2895 2900 

etc aat gag acc acc aac tgg ttg tgg gcg ttt ttg gcc aga gaa 

8850 

Leu Asn Glu Thr Thr Asn Trp Leu Trp Ala Phe Leu Ala Arq Glu 
2905 - 2910 2915 

8895^^*" ^^"^ ^""^ ^^"^ '^^^ 

Lys Arg Pro Arg Met Cys Ser Arg Glu Glu Phe He Arg Lys Val 
2920 2925 2930 

aac age aat gca get ttg ggt gee atg ttt gaa gag cag aat caa 

8 940 

Asn Ser Asn Ala Ala Leu Gly Ala Met Phe Glu Glu Gin Asn Gin 
2935 2940 2945 

tgg^agg age gcc aga gaa gca gtt gaa gat cca aaa ttt tgg gag 

Trp Arg Ser Ala Arg Glu Ala Val Glu Asp Pro Lys Phe Trp Glu 
2950 2955 2960 

9030^*^^ 

Met Val Asp Glu Glu Arg Glu Ala His Leu Arg Gly Glu Cys His 
2965 2970 2975 

act tgc att tac aac atg atg gga aag aga gag aaa aaa ccc gga 

Thr Cys He Tyr Asn Met Met Gly Lys Arg Glu Lys Lys Pro Glv 
2980 2985 2990 

9120^*^^ 99^ 9cc aag gga age aga gcc att tgg ttc atg tgg 

Glu Phe Gly Lys Ala Lys Gly Ser Arg Ala He Trp Phe Met Trp 
2995 3000 3005 

etc gga get cgc ttt ctg gag ttc gag get ctg ggt ttt etc aat 

9165 

Leu Gly Ala Arg Phe Leu Glu Phe Glu Ala Leu Gly Phe Leu Asn 
3010 3015 3020 

gaa gae eae tgg ett gga aga aag aac tea gga gga ggt gtc gag 
92 1 0 

Glu Asp His Trp Leu Gly Arg Lys Asn Ser Gly Gly Gly Val Glu 
3025 3030 3035 

ggc ttg ggc etc caa aaa ctg ggt tac ate ctg cgt gaa gtt ggc 
9255 
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FIGURE 12U 

Gly Leu Gly Leu Gin Lys Leu Gly Tyr He Leu Arg Glu Val Gly 
3040 3045 3050 

acc egg cct ggg ggc aag ate tat get gat gac aca get ggc tgg 

9300 

Thr Arg Pro Gly Gly Lys He Tyr Ala Asp Asp Thr Ala Gly Trp 
3055 3060 3065 

gac^aee egc ate aeg aga get gae ttg gaa aat gaa get aag gtg 

Asp Thr Arg He Thr Arg Ala Asp Leu Glu Asn Glu Ala Lys Val 
3070 3075 3080 

ctt^gag etg ctt gat ggg gaa eat egg egt ett gee agg gcc ate 

Leu Glu Leu Leu Asp Gly Glu His Arg Arg Leu Ala Arg Ala He 
3085 3090 3095 

att^gag etc aee tat egt cae aaa gtt gtg aaa gtg atg cgc ccg 

He Glu Leu Thr Tyr Arg His Lys Val Val Lys Val Met Arg Pro 
3100 3105 3110 

get get gat gga aga aee gtc atg gat gtt ate tec aga gaa qat 

9480 3 

Ala Ala Asp Gly Arg Thr Val Met Asp Val He Ser Arg Glu Asp 
3115 3120 3125 

eag agg ggg agt gga caa gtt gtc ace tac gee eta aac act tte 

9525 

Gin Arg Gly Ser Gly Gin Val Val Thr Tyr Ala Leu Asn Thr Phe 
3130 3135 33^4Q 

acc^aac etg gee gtc eag etg gtg agg atg atg gaa ggg gaa gga 

Thr Asn Leu Ala Val Gin Leu Val Arg Met Met Glu Gly Glu Gly 
3145 3150 3155 

feis^*^*^ 

Val He Gly Pro Asp Asp Val Glu Lys Leu Thr Lys Gly Lys Gly 
3160 3165 3170 

ccc aaa gtc agg ace tgg etg ttt gag aat ggg gaa gaa aga etc 

Pro Lys Val Arg Thr Trp Leu Phe Glu Asn Gly Glu Glu Arg Leu 
3175 3180 3185 

agc^cgc atg get gtc agt gga gat gac tgt gtg gta aag cec etg 
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FIGURE 12V 

Ser Arg Met Ala Val Ser Gly Asp Asp Cys Val Val Lys Pro Leu 
3190 • 3195 3200 

gac gat cgc ttt gcc acc teg etc cac ttc etc aat get atg tea 
9750 

Asp Asp Arg Phe Ala Thr Ser Leu His Phe Leu Asn Ala Met Ser 
3205 3210 3215 

aag gtt cgc aaa gac ate caa gag tgg aaa ccg tea act gga tgg 

9795 

Lys Val Arg Lys Asp lie Gin Glu Trp Lys Pro Ser Thr Gly Trp 
3220 3225 3230 

tat gat tgg cag cag gtt eca ttt tgc tea aac eat ttc act gaa 
9840 

Tyr Asp Trp Gin Gin Val Pro Phe Cys Ser Asn His Phe Thr Glu 
3235 3240 3245 

ttg ate atg aaa gat gga aga aca ctg gtg gtt eca tgc ega gga 
9885 

Leu He Met Lys Asp Gly Arg Thr Leu Val Val Pro Cys Arg Gly 
3250 3255 3260 

cag gat gaa ttg gta gge aga get cgc ata tet eca ggg gcc gga 
9930 

Gin Asp Glu Leu Val Gly Arg Ala Arg He Ser Pro Gly Ala Gly 
3265 3270 3275 

tgg aac gtc cgc gac act get tgt ctg get aag tet tat gcc cag 

9975 

Trp Asn Val Arg Asp Thr Ala Cys Leu Ala Lys Ser Tyr Ala Gin 
3280 3285 3290 

atg tgg ctg ctt ctg tac ttc cac aga aga gac ctg egg etc atg 
10020 

Met Trp Leu Leu Leu Tyr Phe His Arg Arg Asp Leu Arg Leu Met 
3295 3300 3305 

gcc aac gee att tgc tee get gtc cet gtg aat tgg gtc ect ace 
10065 

Ala Asn Ala He Cys Ser Ala Val Pro Val Asn Trp Val Pro Thr 
3310 3315 3320 

gga aga acc acg tgg tec ate cat gca gga gga gag tgg atg aca 
10110 

Gly Arg Thr Thr Trp Ser He His Ala Gly Gly Glu Trp Met Thr 
3325 3330 3335 

aca gag gac atg ttg gag gtc tgg aac cgt gtt tgg ata gag gag 
10155 



FIGURE 12W 

Thr Glu Asp Met Leu Glu Val Trp Asn Arg Val Trp lie Glu Glu 
3340 3345 3350 

aat gaa tgg atg gaa gac aaa acc cca gtg gag aaa tgg agt gac 
10200 

Asn Glu Trp Met Glu Asp Lys Thr Pro Val Glu Lys Trp Ser Asp 
3355 3360 3365 

gtc cca tat tea gga aaa cga gag gac ate tgg tgt ggc age ctg 
10245 

Val Pro Tyr Ser Gly Lys Arg Glu Asp lie Trp Cys Gly Ser Leu 
3370 3375 3380 

att ggc aca aga gcc cga gcc acg tgg gca gaa aac ate cag gtg 
10290 

He Gly Thr Arg Ala Arg Ala Thr Trp Ala Glu Asn He Gin Val 
3385 3390 3395 

get ate aac caa gtc aga gca ate ate gga gat gag aag tat gtg 
10335 

Ala He Asn Gin Val Arg Ala He He Gly Asp Glu Lys Tyr Val 
3400 3405 3410 

gat tac atg agt tea eta aag aga tat gaa gac aca act ttg gtt 
10380 

Asp Tyr Met Ser Ser Leu Lys Arg Tyr Glu Asp Thr Thr Leu Val 
3415 3420 3425 

gag gac aca gta ctg tag atatttaatc aattgtaaat agacaatata 

10428 

Glu Asp Thr Val Leu 
3430 

agtatgeata aaagtgtagt tttatagtag tatttagtgg tgttagtgta aatagttaag 
10488 

aaaattttga ggagaaagte aggcegggaa gttcccgeca ecggaagttg agtagacggt 
10548 

gctgcetgcg aeteaaccce aggaggaetg ggtgaacaaa gecgegaagt gatccatgta 
10608 

agcectcaga acegtctegg aaggaggaec ccaeatgttg taactteaaa gcecaatgte 
10668 

agaccacget aeggcgtgct actctgegga gagtgeagtc tgcgatagtg ecccaggagg 
10728 

aetgggttaa caaaggcaaa ceaacgeccc acgeggecet agecceggta atggtgttaa 
10788 
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FIGURE 12X 

ccagggcgaa aggactagag gttagaggag accccgcggt ttaaagtgca cggcccagcc 

10848 

10908^^^^'' tgtaggtcag gggaaggact agaggttagt ggagaccccg tgccacaaaa 
caccacaaca aaacagcata ttgacacctg ggatagacta ggagatcttc tgctctgcac 
ff^oJ^''"'^'' ^''^^''^^^S^ gcgccgacaa tggtggctgg tggtgcgaga acacaggatc 

t 

11029 
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Figure 13, 5kb C5 locus and PCR primers to amplify C5 arms (SEQ ID NO: 66) 



CTGAAATTGTAATTTCTACATGTAGAGAAGGTTTTGATATTGATGGTTTTAACAGAAACG 
GACTTTAACATTAAAGATGTACATCTCTTCCAAAACTATAACTACCAAAATTGTCTTTGC 
10 20 30 40 50 60 



TAGAAATTATATCAAGGGATAACATTTTATAT6ATATAGTTTTAAAGTGTAAGATGGAAT 
ATCTTTAATATAGTTCCCTATTGTAAAATATACTATATCAAAATTTCACATTCTACCTTA 
70 80 90 . 100 110 120 

TAAATTTCATGTGCACAAGAGGCATAGGAGATAAAAGCATTTTCAGACTTTGTATAATGA 
ATTTAAAGTACACGTGTTCTCCGTATCCTCTATTTTCGTAAAAGTCTGAAACATATTACT 
130 X40 150 160 170 180 

AGGAATATGATCAAATAAACAAGAATCTGTTAGTTAGTTACTTGGATAAATTAATCGAGA 
TCCTTATACTAGTTTATTTGTTCTTAGACAATCAATCAATGAACCTATTTAATTAGCTCT 
190 200 210 220 230 240 

CGCGTGATAAAATGACTATGTACCGTTATTGCATGAACGATATTATAAATATAGGTTCTC 
GCGCACTATTTTACTGATACATGGCAATAACGTACTTGCTATAATATTTATATCCAAGAG 
250 260 270 280 290 300 

(CSAl) GGCCGAATTC 
GTAGGAGAGAACTATTGACTATGGC A A T G A A T G T T A A A T G T T A T A C T T T GGATGAAGCTA 
CATCCTCTCTTGATAACTGATACCGTTACTTACAATTTACAATATGAAACCTACTTCGAT 
310 320 330 340 350 360 

TAAATATGCATTGGAAAAATAATCCATTTAAAGAAAGGATTCAAATACTACAAAACCTAA 
ATTTATACGTAACCTTTTTATTAGGTAAATTTCTTTCCTAAGTTTATGATGTTTTGGATT 
370 380 390 400 410 420 



GCGATAATATGTTAACTAAGCTTATTCTTAACGACGCTTTAAATATACACAAATAAACAT 
CGCTATTATACAATTGATTCGAATAAGAATTGCTGCGAAATTTATATGTGTTTATTTGTA 
430 440 450 460 470 480 



AATTTTTGTATAACCTAACAAATAACTAAAACATAAAAATAATAAAAGGAAATGTAATAT 
TTAAAAACATATTGGATTGTTTATTGATTTTGTATTTTTATTATTTTCCTTTACATTATA 
490 500 510 520 530 540 
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cgtaattatttta'ctcaggaatggggttaaatatttatatcacgtgtatatctatactgt 
gcattaataaaatgagtccttaccccaatttataaatatagtgcacatatagatatgaca 

550 560 570 580 590 600 

TATCGTATACTCTTTACAATTACTATTACGAATATGCAAGAGATAATAAGATTACGTATT 
ATAGCATATGAGAAATGTTAATGATAATGCTTATACGTTCTCTATTATTCTAATGCATAA 
6X0 €20 €30 640 650 660 

TAAGAGAATCTTGTCATGATAATTGGGTACGACATAGTGATAAATGCTATTTCGCATCGT 
ATTCTCTTAGAACAGTACTATTAACCCATGCTGTATCACTATTTACGATAAAGCGTAGCA 
670 680 690 700 710 . 720 



TACATAAAGTCAGTTGGAAAGAT6GATTTGACAGATGTAACTTAATAGGTGCAAAAATGT 
ATGTATTTCAGTCA.ACCTTTCTACCTAAACTGTCTACATTGAATTATCCACGTTTTTACA 
730 740 750 760 770 780 



TAAATAACAGCATTCTATCGGAAGATAGGATACCAGTTATATTATACAAAAATCACTGGT 
ATTTATTGTCGTAAGATAGCCTTCTATCCTATGGTCAATATAATATGTTTTTAGTGACCA 
790 800 810 820 830 840 

TGGATAAAACAGATTCTGCAATATTCGTAAAAGATGAAGATTACTGCGAATTTGTAAACT 
ACCTATTTTGTCTAAGACGTTATAAGCATTTTCTACTTCTAATGACGCTTAAACATTTGA 
850 860 870 880 890 900 

ATGACAATAAAAAGCCATTTATCTCAACGACATCGTGTAATTCTTCCATGTTTTATGTAT 
TACTGTTATTTTTCGGTAAATAGAGTTGCTGTAGCACATTAAGAAGGTACAAAATACATA 
910 920 930 940 950 960 

GTGTTTCAGATATTATGAGATTACTATAAACTTTTTGTATACTTATATTCCGTAAACTAT 
CACAAAGTCTATAATACTCTAATGATATTTGAAAAACATATGAATATAAGGCATTTGATA 
970 980 990 1000 1010 1020 



ATTAATCATGAAGAAAATGAAAAAGTATAGAAGCTGTTCACGAGCGGTTGTTGAAAACAA 
TAATTAGTACTTCTTTTACTTTTTCATATCTTCGACAAGTGCTCGCCAACAACTTTTGTT 
1030 1040 1050 1060 1070 1080 

CAAAATTATACATTCAAGATGGCTTACATATACGTCTGTGAGGCTATCATGGATAATGAC 
GTTTTAATATGTAAGTTCTACCGAATGTATATGCAGACACTCCGATAGTACCTATTACTG 
1090 1100 UIO 1120 1130 UAO 
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AATGCATCTCTAAATAGGTTTTTGGACAATGGATTCGACCCTAACACGGAATATGGTACT 
TTACGTAGAGATTTATCCAAAAACCTGTTACCTAAGCTGGGATTGTGCCTTATACCATGA 
1150 1160 1170 1180 1190 1200 



CTACAATCTCCTCTTGAAATGGCTGTAATGTTCAAGAATACCGAGGCTATAAAAATCTTG 
GATGTTAGAGGAGAACTTTACCGACATTACAAGTTCTTATGGCTCCGATATTTTTAGAAC 
i9in 1220 1230 1240 1250 1260 



ATGAGGTATGGAGCTAAACCTGTAGTTACTGAATGCACAACTTCTTGTCTGCATGATGCG 
TACTCCATACCTCGATTTG6ACATCAATGACTTACGTGTTGAAGAACAGACGTACTACGC 
1270 1280 1290 1300 1310 1320 



GTGTTGAGAGACGACTACAAAATAGTGAAAGATCTGTTGAAGAATAACTATGTAAACAAT 
CACAACTCTCTGCTGATGTTTTATCACTTTCTAGACAACTTCTTATTGATACATTTGTTA 
1330 1340 1350 1360 U70 1380 



GTTCTTTACAGCGGAGGCTTTACTCCTTTGTGTTTGGCAGCTTACCTTAACAAAGTTAAT 
CAAGAAATGTCGCCTCCGAAATGAGGAAACACAAACCGTCGAATGGAATTGTTTCAATTA 
1390 1400 1410 1420 1430 1440 

TTGGTTAAACTTCTATTGGCTCATTCGGCGGATGTAGATATTTCAAACACGGATCGGTTA 
AACCAATTTGAAGATAACCGAGTAAGCCGCCTACATCTATAAAGTTTGTGCCTAGCCAAT 
1450 1460 1470 1480 1490 1500 

ACTCCTCTACATATAGCCGTATCAAATAAAAATTTAACAATGGTTAAACTTCTATTGAAC 
TGAGGAGATGTATATCGGCATAGTTTATTTTTAAATTGTTACCAATTTGAAGATAACTTG 
1510 1520 1530 1540 1550 1560 

AAAGGTGCTGATACTGACTTGCTGGATAACATGGGACGTACTCCTTTAATGATCGCTGTA 
TTTCCACGACTATGACTGAACGACCTATTGTACCCTGCATGAGGAAATTACTAGCGACAT 
1570 1580 1590 1600 1610 1620 



CAATCTGGAAATATTGAAATATGTAGCACACTACTTAAAAAAAATAAAATGTCCAGAACT 
GTTAGACCTTTATAACTTTATACATCGTGTGATGAATTTTTTTTATTTTACAGGTCTTGA 
1630 1640 1650 1660 1670 1680 



GGGAAAAATTGATCTTGCCAGCTGTAATTCATGGTAGAAAAGAAGTGCTCAGGCTACTTT 
CCCTTTTTAACTAGAACGGTCGACATTAAGTACCATCTTTTCTTCACGAGTCCGATGAAA 

1690 1700 1710 1720 1730 1740 ^ 
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TCAACAAAGGAGCAGATGTAAACTACATCTTTGAAAGAAATGGAAAATCATATACTGTTT 
AGTTGTTTCCTCGTCTACATTTGATGTAGAAACTTTCTTTACCTTTTAGTATATGACAAA 
1750 1760 1770 1780 1790 1800 



TGGAATTGATTAAAGAAAGTTACTCTGAGACACAAAAGAGGTAGCTGAAGTGGTACTCTC 
ACCTTAACTAATTTCTTTCAATGAGACTCTGTGTTTTCTCC ATCGACTTCACCATGAGAG 

1820 1830 1840 1850 i860 

C5 GRF 

METGLNAaJASPASPCYSaUAIAAa;SERWGGtZJII£imi£UTOASPHffi 
AAAATGCAGAACGATGACTGCGAAGCAAGAAGTAGAGAAATAACACTTTATGACTTTCTT 
TjrXTACGTCTTGCTACTGACGCTTCGTTCTTCATCTCTTTATTGTGAAATACTGAAAGAA 
^870 1880 1890 1900 1910 1920 

^CCATGCACTGATTAATCGAp^TT TTfllcCATGdGCCC {C5B1)_ 



SaiCffiARGUBASPJ«GiCTn£METMETVRLIIEAaJAaJSERASPII£A^ 
AGTTGTAGAAAAGATAGAGATATAATGAT6GTCATAAATAACTCTGATATTGCAAGTAAA 
TCAACATCTTTTCTATCTCTATATTACTACCAGTATTTATTGAGACTATAACGTTCATTT 
1^30 1940 1950 I960 1970 1980 

C5BAaiAaiiaffiIHJASPIBUFHEiaSARGII£\M,iaBASNOTI^ 
TGCAATAATAAGTTAGATTTATTTAAAAGGATAGTTAAAAATAGAAAAAAAGAGTTAATT 
ACGTTATTATTCAATCTAAATAAATTTTCCTATCAATTTTTATCTTTTTTTCTCAATTAA 

2000 2010 2020 2030 2040 

CffiAIGVftLLYSII£II£HrSLreiI£IEJiaSHEII£AaiTHRH^ 
TGTAGGGTTAAAATAATACATAAGATCTTAAAATTTATAAATACGCATAATAATAAAAAT 
ACATCCCAATTTTATTATGTATTCTAGAATTTTAAATATTTATGCGTATTATTATTTTTA 
2050 2060 2070 2080 2090 2100 

(C5C1) IGGATCdCGGGfrTTTTAHGA CTAGTTA ATCACGGCCG 
ARG IHJmiaJLHJJROSERGUJ XlEViSPt^LYSIlEmETmTYR UU \ THR TO Lffi 

agattatacttattaccttcagagataaaatttaagatatttacttatttaA cttataaa 

TCTAATATGAATAATGGAAGTCTCTATTTTAAATTCTATAAATGAATAAATTGAATATTT 
2110 2120 2130 2140 2150 2160 

ASP IHJ US C5S HE HE SER Lffi *** 

GATCTAAAATGCATAATTTCTAAATAATGAAAAAAAGTACATCATGAGCAACGCGTTAGT 
CTAGATTTTACGTATTAAAGATTTATTACTTTTTTTCATGTAGTACTCGTTGCGCAATCA 
2170 2180 2190 2200 2210 2220 



ATATTTTACAATGGAGATTAACGCTCTATACCGTTCTATGTTTATTGATTCAGATGATGT 
TATAAAATGTTACCTCTAATTGCGAGATATGGCAAGATACAAATAACTAAGTCTACTACA 
2230 2240 2250 2260 2270 2280 
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TTTAGAAAAGAAAGTTATTGAATATGAAAACTTTAATGAAGATGAAGATGACGACGATGA 
AAATCTTTTCTTTCAATAACTTATACTTTTGAAATTACTTCTACTTCTACTGCTGCTACT 
2290 2300 2310 2320 2330 2340 

) 

TTATTGTTGTAAATCTGTTTTAGATGAAGAAGATGACGCGCTAAAGTATACTATGGTTAC 
AATAACAACATTTAGACAAAATCTACTTCTTCTACTGCGCGATTTCATATGATACCAATG 
2350 2360 2370 2380 2390 2400 



AAAGTATAAGTCTATACTACTAATGGCGACTTGTGCAAGAAGGTATAGTATAGTGAAAAT 
TTTCATATTCAGATATGATGATTACCGCTGAACACGTTCTTCCATATCATATCACTTTTA 
2410 2420 2430 2440 2450 2460 



GTTGTTAGATTATGATTATGAAAAACCAAATAAATCAGATCCATATCTAAAGGTATCTCC 
CAACAATCTAATACTAATACTTTTTGGTTTATTTAGTCTAGGTATAGATTTCCATAGAGG 
2470 2480 2490 2500 2510 2520 



TTTGCACATAATTTCATCTATTCCTAGTTTAGAATACTTTTCATTATATTTGTTTACAGC 
AAACGTGTATTAAAG TAGATAAGGATCAAATCTTATG AAAAGTAATATAAACAAATGTCG 
2530 2540 2550 \ 2560 2570 2580 

\ G ACGTCGG (C5D1 ) 

TGAAGACGAAAAAAATATATCGATAATAGAAGATTATGTTAACTCTGCTAATAAGATGAA 
ACTTCTGCTTTTTTTATATAGCTATTATCTTCTAATACAATTGAGACGATTATTCTACTT 
2590 2600 2610 2620 2630 2640 



ATTGAATGAGTCTGTGATAATAGCTATAATCAGAG'AAGTTCTAAAAGGAAATAAAAATCT 
TAACTTACTCAGACACTATTATCGATATTAGTCTCTTCAAGATTTTCCTTTATTTTTAGA 
2650 2660 2670 2680 2690 2700 



AACTGATCAGGATATAAAAACATTGGCTGATGAAATCAACAAGGAGGAACTGAATATAGC 
TTGACTAGTCCTATATTTTTGTAACCGACTACTTTAGTTGTTCCTCCTTGACTTATATCG 
2710 2720 2730 2740 2750 2760 



TAAACTATTGTTAGATAGAGGGGCCAAAGTAAATTACAAGGATGTTTACGGTTCTTCAGC 
ATTTGATAACAATCTATCTCCCCGGTTTCATTTAATGTTCCTACAAATGCCAAGAAGTCG 
2770 2780 2790 2800 2810 2820 



TCTCCATAGAGCTGCTATTGGTAGGAAACAGGATATGATAAAGCTGTTAATCGATCATGG 
AGAGGTATCTCGACGATAACCATCCTTTGTCCTATACTATTTCGACAATTAGCTA6TACC 
2830 2840 2850 2860 2870 
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AGCTGATGTAAACTCTTTAACTATTGCTAAAGATAATCTTATTAAAAAAAAATAATATCA 
TCGACTACATTTGAGAAATTGATAACGATTTCTATTAGAATAATTTTTTTTTATTATAGT 
2890 2900 2910 2920 2930 2940 



CGTTTAGTAATATTAAAATATATTAATAACTCTATTACTAATAACTCCAGTGGATATGAA 
GCAAATCATTATAATTTTATATAATTATTGAGATAATGATTATTGAGGTCACCTATACTT 
2950 2960 2970 2980 2990 3000 



CATAATACGAAGTTTATACATTCTCATCAAAATCTTATTGACATCAAGTTAGATTGTGAA 
GTATTATGCTTCAAATATGTAAGA6TAGTTTTAGAATAACTGTAGTTCAATCTAACACTT 
3010 3020 3030 3040 3050 3060 



AATGAGATTATGAAATTAAGGAATACAAAAATAGGATGTAAGAACTTACTAGAATGTTTT 
TTACTCTAATACTTTAATTCCTTATGTTTTTATCCTACATTCTTGAATGATCTTACAAAA 
3070 3080 3090 3100 3110 3120 



ATCAATAATGATATGAATACAGTATCTAGGGCTATAAACAATGAAACGATTAAAAATTAT 
TAGTTATTACTATACTTATGTCATAGATCCCGATATTTGTTACTTTGCTAATTTTTAATA 
3130 3140 3150 3160 3170 3180 



AAAAATCATTTCCCTATATATAATACGCTCATAGAAAAATTCATTTCTGAAAGTATACTA 
TTTTTAGTAAAGGGATATATATTATGCGAGTATCTTTTTAAGTAAAGACTTTCATATGAT 
3190 3200 3210 3220 3230 3240 



AGACACGAATTATTGGATGGAGTTATAAATTCTTTTCAAGGATTCAATAATAAATTGCCT 
TCTGTGCTTAATAACCTACCTCAATATTTAAGAAAAGTTCCTAAGTTATTATTTAACGGA 
3250 3260 3270 3280 3290 3300 



TACGAGATTCAGTACATTATACTGGAGAATCTTAATAACCATGAACTAAAAAAAATTTTA 
ATGCTCTAAGTCATGTAATATGACCTCTTAGAATTATTGGTACTTGATTTTTTTTAAAAT 
3310 3320 3330 3340 3350 3360 



GATAATATACATTAAAAAGGTAAATAGATCATCTGTTATTATAAGCAAAGATGCTTGTTG 
CTATTATATGTAATTTTTCCATTTATCTAGTAGACAATAATATTCGTTTCTACGAACAAC 
3370 3380 3390 3400 3410 3420 



CCAATAATATACAACAGGTATTTGTTTTTATTTTTAACTACATATTTGATGTTCATTCTC 
GGTTATTATATGTTGTCCATAAACAAAAATAAAAATTGATGTATAAACTACAAGTAAOAG 
3430 3440 3450 3460 3470 3480 
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TTTATATAGTATACACAGAAAATTCATAATCCACTTAGAATTTCTAGTTATCTAGTTTTT 

AAATATATCATATGTGTCTTTIAAGTATTAGGTGAATCTTAAAGATCAATAGATCAAAAA 
3490 



3500 



3510 



3520 



3530 



3540 



CTAGAATATTGTACTTTATTTCTAATGGAATGGCTCTCCAGCCTAGTAATTTATTAATGT 

STCGGATCATTAAATAATTACA 



3550 



3560 



3570 



3580 



3590 



3600 



TAGCTGATATCTTGAAATCAGGATATTCTGCTCCGTGAAGAGAAAGTCCTCCAAAGTTOT 
ATCGACTATAGAACTTTAGTCCTATAAGACGAGGCACTTCTCTTTCAGGAGGTTTCAACA 

^20 3630 3640 3650 3660 



ATATTTCCATCACTTTCATGGCTTCCTCTTTCCATAGTGTCTTCTATAAGCTGTCTATAT 

TATAAAGGTAGTGAAAGTACCGAAGGAGAAAGGTATCACAGAAGATATTCGACA6ATATA 
3670 



3680 



3690 



3700 



3710 



3720 



ATTGTAAACTTTTCTGGTTTTATGCATTTTAAACATTTAGCAATCTCATTTTCATCACAA 
3730 3740 



3750 



3760 



T AAAAGTAGTGTT 
3770 3780 



TTAAGGCACAAATCTAACATGGAATGTCTACCATAACCCAATAAGGTTTTTTCATTTCCT 

AATTCCGTGTTTAGATTGTACCTTACAGATGGTATTGGGTTATTCCAAAAAAGTAAAGGA 
3790 



3800 



3810 



3820 



3830 



3840 



CTATCTCTAATACACACTGTTCTTTCCAGACTTTCAACACGCTGCTATTTTCTATTTTAT 
GATAGAGATTATGTGTGACAAGAAAGGTCTGAAAGTTGTGCGACGATAAAAGATAAAATA 
'''' 3870 3880 3890 3900 

TCAAGTCCATATTATAAGCGTCCTTGTTAGACACTTCATAATGTTTGCATTCTGGAATCA 
AGTTCAGGTATAATATTCGCAGGAACAATCTGTGAAGTATTACAAACGTAAGACCTTAGT 

3940 3950 3960 



TCATGTTAGATACTATTAATTTAGCTACTTCTATGTTGTCATCAAAAGAGTTGCTATCTG 

AGTACAATCTATGATAATTAAATCGATGAAGATACAACAGTAGTTTTCTCAACGATAGAC 
3970 3980 



3990 



4000 



4010 



4020 




ATTAATGTGATT 
4030 



TCTGAATT 



4040 



4050 



4060 



4070 



4080 
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TAAGCAATACCTCAATAACTTCTTTTGAAGATGACTTTGCAGCTAAAAATAATGGAGTTC 
ATTCGTTATGGAGTTATTGAAGAAAACTTCTACTGAAACGTCGATTTTTATTACCTCAAG 
4090 4100 4110 4120 4130 4140 



TCTCTAAAACATCCCTTGAGTTTATATTAGCCCCGTAACTAATGAGTAGTTCTGTTATAT 
AGAGATTTTGTAGGGAACTCAAATATAATCGGGGCATTGATTACTCATCAAGACAATATA 
4150 4160 4170 4180 4190 4200 



CTTTGGAATCTATTGATATATTATTAATTACAATTGTCATGCTGACATATATAGACATCA 
GAAACCTTAGATAACTATATAATAATTAATGTTAACAGTACGACTGTATATATCTGTAGT 
4210 4220 4230 4240 4250 4260 



TAATATGATGAAAAATATGAAAATATAAGTGCACGTTTACTGTTACTATGATTGTGATAT 
ATTATACTACTTTTTATACTTTTATATTCACGTGCAAATGACAATGATACTAACACTATA 
4270 4280 4290 4300 4310 4320 



CGATATGAGTTCTTTAATAAAAGTACTGAAATAGATATAATGCAGATATGATTGATATTT 
GCTATACTCAAGAAATTATTTTCATGACTTTATCTATATTACGTCTATACTAACTATAAA 
4330 4340 4350 4360 4370 4380 



TAAAAAGTTGAAAAAAAATATGCCCTGTTTACAAATACTATTTGGAAATATTCTGTAATA 
;^Y.pT.pTC;^;^Crj.TTTTTTTATACGGGACAAATGTTTATGATAAACCTTTATAAGACATTAT 
4390 4400 4410 4420 4430 



AAGTAATAGTGATATGTCAGTCACGATGGATTTGCCAATTGATCATATGAGTATAGATAA 
TTCATTATCACTATACAGTCAGTGCTACCTAAACGGTTAACTAGTATACTCATATCTATT 
4450 4460 4470 4480 4490 4500 



CATAAACGAGTATAATAAAAATGGATATACTAGACTCTATATAGAGGTAGCCATGAAAAA 
GTATTTGCTCATATTATTTTTACCTATATGATCTGAGATATATCTCCATCGGTACTTTTT 
4510 4520 4530 4540 4550 4560 



ACGTAAAAACGTAGATAGACTTTTATATCTCGGAGCTGATCCGAATCTGGCTAGTGTAGA 
TGCATTTTTGCATCTATCTGAAAATATAGAGCCTCGACTAGGCTTAGACCGATCACATCT 
4570 4580 4590 4600 4610 4620 



TTCGTATTGTCCTCTTCATATTGCTGTTAGGAATGGTAGTTTAAAGATAATAAGATCATT 
AAGCATAACAGGAGAAGTATAACGACAATCCTTACCATCAAATTTCTATTATTCTAGTAA 
4630 4640 4650 4660 4670 4680 
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GTTGAAATATGGTGCTAATATAAATCAAGAATGTCATGAAGGAGATACTGCTTTGATOAT 

CAACTTTATACCACGATTATATTTAGTTCTTACAGTACTTCCTCTATGACGAAACTACTA 

*T00 4710 4720 



4730 4740 



GGCTATATCATTAGGTAATTATACAGCATGTAAAACACTTCTAGATAACAACGCCGATCC 
CCGATATAGTAATCCATTAATATGTCGTACATTTTGTGAAGATCTATTGTTGCGGCTAGG 



««» 4770 4780 



4790 4600 



TAATTATGTTAACTATTACGGTATAGTTCCGCTTATTAGAGCAATTATATGTGAAAAG 



4810 



4820 



4830 



CC 

:tcgttaatatacacttttcg6 

4840 4850 4860 



TGACATAGTTAGACTGCTATTAGATAGAGG 
ACTGTATCA AT 



4870 



4880 



4890 



AGCTAATTGCAACCACTTAATTACAAAAAA 

TGTTTTTT 
4900 4910 4920 



GC 



AATCTCTCAGAATCCTTAACGAAAAAATTTCTGTTAAGAAG 
^«0 4960 4970 4980 



ATCATTGTCGATACTAATAT 
TAGTAACAGCTATGATTATA 
4990 5000 



00150910 



Figure 14. VQH6 amplified fragment (SEQ ID NO:76) 

VQ marker 

BamH I VIRQGENET 
AAA |GGATC(^ GGTTAATTAATTAGTCATCAGGCAGGGCGAGAACGAGACT 

TTTCCTAGGCCCAATTAATTAATCAGTAGTCCGTCCCGCTCTTGCTCTGA 

ICS*** => H6p 

ATCTGCTCGTTAATTAATTAGAGCTTCTTTATTCTATACTTAAAAAGTGA 
TAGACGAGCAATTAATTAATCTCGAAGAAATAAGATATGAATTTTTCACT 

AAATAAATACAAAGGTTCTTGAGGGTTGTGTTAAATTGAAAGCGAGAAAT 
TTTATTTATGTTTCCAAGAACTCCCAACACAATTTAACTTTCACTCTTTA 

EcoR V 

AATCATAAATTATTTCATTATCGC lGATATq CGTTAAGTTTGTATCGTAGG 
TTAGTATTTAATAAAGTAATAGCGCTATAGGCAATTCAAACATAGCAI^ 



iq;>n I Xho I Xba I Cla I Sma I 
TACC |CTCGAGf rCTAGA |ATCQAT| CCCGGGTTTT 



ATGG|GAGCT dAGATClt rAGCTA |GGGCCC| AAAA 



Figure 15. Restriction map.of pNVQH6C5LSP-18 




